Mathematics for | 
Engineers 
& Applied 
Scientists δον goawec” 


Mathematics for 
Engineers and 
Applied 
Scientists 


STANLEY C. LENNOX 


B.Sc. (Dunelm), M.Sc. (Dunelm), Ph.D. (Edin.), F.I.M.A. 
Senior Lecturer in Engineering Mathematics, 
University of Newcastle upon Tyne 


MARY CHADWICK 


B.Sc. (Liverpool) 
Formerly Lecturer in Mathematics, King’s College, 
University of Durham and 
Lecturer in Mathematics at Norwich City College 


IX 
χὰ 
en 
ἘΣΣΙ 


HEINEMANN 
LONDON 


Heinemann Educational Books Ltd 
22 Bedford Square, London WC1B 3HH 
LONDON EDINBURGH MELBOURNE AUCKLAND 
SINGAPORE KUALALUMPUR NEW DELHI 
IBADAN NAIROBI JOHANNESBURG 
PORTSMOUTH (NH) KINGSTON 


ISBN 0 435 71282 9 en 


© Stanley C. Lennox and Mary Chadwick, 1970, 1977 
First published 1970 
Reprinted 1972, 1974. . 
\Second Edition 1977 
Reprinted 1979; 1981, 1983, 1985, 1987 


eR Sok ateryt Gu . Pry 


Set in 10/11 pt and 9/10 pt Times New Roman 
Printed and bound in Great Britain by 
Biddles Ltd, Guildford and King’s Lynn 


Preface to the Second Edition 


We have been pleased with the success of our book and its adoption 
as a working text by first year students in many engineering courses at 
Universities and higher technological Institutes. Following our own 
ideas, but also in response to requests, we have, in this edition, included 
new material. In Chapter 9 we have added two new sections, one on 
eigenvalues and eigenvectors and the other an introduction to the idea 
of the gradient of a scalar field. Both these topics are often met by 
engineering students during the first or second year courses in their 
own engineering disciplines. In Chapter 11 a new section on isoclines 
has been added and the ideas of the numerical integration of differ- 
ential equations form a new section in Chapter 13. Two other new 
sections of Chapter 13 give an elementary introduction to error analysis. 
Now that all students have electronic pocket calculators it seems to us 
important that the concept of accuracy, propagation of error, and error 
bounds should be dealt with early in the curriculum. Additional 
examples have been added to the relevant Exercises. 

This edition also contains a completely new Chapter 17 on Applica- 
tions. The teacher of mathematics is often asked by engineering students 
to explain why it is that they have to be taught topics such as eigen- 
values, vectors, complex numbers, partial differentiation. Chapter 17 
goes some little way towards showing students that these topics (and 
others) have application in the physical and industrial world. We would 
have liked this chapter to be much larger and to have covered more 
topics but are necessarily restricted by limitation on the size (and price) 
of the book. We will be satisfied if the student finds something of 
interest which will induce further study of mathematics and if the 
teachers of mathematics to engineers can be encouraged to investigate 
further applications themselves. One of us (SCL) is indebted to his 
colleagues at Newcastle upon Tyne for much discussion on the appli- 
cations but we both accept responsibility for any mistakes that have 
appeared. We would like to thank all those readers who have written 
to us with suggestions and also those who have indicated errors in the 
first edition. We hope we have incorporated all corrections and ideas in 
this edition. We would be pleased to have any further comments. 

Finally our thanks are due to our publishers who have allowed us 
to amend the text as we wished and have made an excellent job of 
putting together the new edition and have tried to keep the costing to 
a minimum. 


Stanley C. Lennox Mary Chadwick 
1977 


Preface to the First Edition 


This book is the outcome of many years of teaching mathematics to 
engineering students particularly in the University of Newcastle upon 
Tyne. It is intended to cover the mathematical content of the first year 
to eighteen months of a three year course in a University or Institute 
of higher technological education. For certain disciplines the contents 
will meet the complete mathematical requirements of the entire three 
year course while for others the contents cover the basic requirements 
before proceeding to more advanced mathematical topics. It covers the 
mathematical syllabus for the Part I examination of the Council of 
Engineering Institutions (CEI). We have assumed a knowledge roughly 
equivalent to a pass in A-level single subject Pure and Applied Mathe- 
matics, a Higher National Certificate, or an equivalent qualification. 

Although primarily written for engineering students the text should 
also be of value to science students whose requirements demand a 
manipulative skill in mathematical processes. Where necessary a 
motivation, usually within an engineering context, is used to introduce 
the reader to a new concept before a proof is introduced. Where such 
a proof is omitted it is simply because we believe that its introduction, 
at that point, would unduly delay the development of the subject and 
may well confuse the student. We do not subscribe to the view that the 
engineer or non-mathematical student should never be exposed to 
mathematical rigour but think that at this particular level it is more 
important that he should be able to use and appreciate the place 
of mathematics in his own subject, and to this end it is necessary that 
the student possesses manipulative skill as well asa broad understanding. 
To give an analogy it is not necessary to have a complete and detailed 
knowledge of the workings of an automobile before attempting to 
drive. Indeed an attempt to teach such detail to some would-be drivers 
may well lead them to abandon the whole process in despair. 

In writing the book we have not made any distinction between 
different kinds of engineering student but consider, no matter what 
discipline, that all should have a common mathematical core syllabus. 
It is now accepted that all engineers should be exposed to the mathe- 
matical ideas of probability, statistics, and numerical analysis and an 
important feature of this book is the introduction of these subjects 
alongside the more common curricula subjects of analysis and algebra. 
Chapters 1 to 7 cover analysis, including functions, differential and 
integral calculus, convergence, and Taylor series. Chapters 8 to 11 deal 
with the algebra of complex numbers, linear algebra, elementary vector 
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analysis, and ordinary differential equations. Chapters 12 to 14 are 
devoted to the analytical and numerical solutions of algebraic and 
transcendental equations, the numerical solution of a set of linear 
equations, an introduction to finite differences and polynomial inter- 
polation, and numerical differentiation and integration. Chapters 15 
and 16 provide a basic statistics course covering distributions, proba- 
bility, sampling, and simple tests of significance. 

The book is, however, presented as a whole and it is not intended 
that branches of mathematics be put into different compartments. For 
convenience it is divided into chapters but that does not mean that any 
series of lectures taken from the book need follow the chapter order. 
Certain chapters obviously demand knowledge of the work covered 
previously, for instance it would be difficult to understand Taylor’s 
theorem in Chapter 5 without having read the previous four chapters. 
On the other hand the two chapters on statistics demand a minimum 
of prior knowledge while the linear algebra of Chapter 8 and the vector 
analysis of Chapter 9 may be started after reading Chapter 1. The 
numerical work of Chapters 12, 13, and 14 can be read in isolation 
providing that differentiation and integration is covered before reaching 
the end of Chapter 14. The sketch gives a visual indication of the 
chapters covered by four possible series of lectures and shows the 
linkage points between the series. 


Series 1: 8—-+——_9-—>—__10—-> 11 
Series 2: 1-»- 2 --3..-.- 4  .5-»-- 6-.-»7 
Series 3: 12.----..-- 3 > > 14 
Series 4: 15—-_—+——_16 


This book can be used as working text and to encourage both under- 
standing and skill a large number of examples is given and solved 
within the text. In addition an exercise is presented at the end of each 
chapter. Every example set has been provided with an answer and hints 
are given when it is considered necessary. Most of the examples are 
original but some have been taken from those collected and amended 
over many years of teaching and examining for the Universities of 
Newcastle upon Tyne, Durham, Edinburgh, Glasgow, and London 
together with the Heriot-Watt and Strathclyde Universities. The 
answers to the exercises are collected together at the end of the book 
and while they have all been checked we will be amazed if some are 
not in error. We would be grateful to any reader who is prepared to 
indicate the point of error for future amendment. 

It is inevitable that certain items have had to be omitted and others 
curtailed but a compromise has to be made between various factors 
such as the size of the book, the time devoted to mathematical studies, 
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and the extent of the syllabus. Throughout, however, it has been our 
aim to achieve a satisfactory balance between these factors. 

Finally, we are indebted first to Professor D. V. Lindley and Dr. 
J. C. P. Miller for permission to reproduce extracts from their book 
of Elementary Statistical Tables, and second to Professor L. Maunder 
both for encouraging us to write the book and for providing us with 
many helpful and constructive criticisms. 


Stanley C. Lennox Mary Chadwick 
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Introduction 


11 FUNCTIONS OF ONE VARIABLE 


When two quantities are so related that one of them is determined 
uniquely when the other is known, then the first quantity is a function 
of the second quantity. That one quantity is a simple function of another 
does not necessarily mean that there is a formula connecting them. A 
function can be expressed as a table of values, in the form of a graph, 
as a formula, and in other ways. 


1.1.1 Notation and definitions 


When two variables x and y are related so that y is determined uniquely 
when x is known, the functional relationship is written y = f(x) where 
J(x) is the rule prescribing the value of y for any given value of x. Other 
notations such as y = g(x), y = (x), y = F(x), $(x, y) = 0 will be 
used, and y = y(x), x = x(t) are used to mean that y is a function of 
x and x is a function of ἢ. 


‘aj aa a<x<b eb 


(b) ας a eb 
Cae geese ΕΗ 
Fig. 1.1 


Suppose y = f(x), then the variable x is called the argument or 
independent variable and f(x) or y the dependent variable or value of 
J for the argument x. Values of y = f(x) are found for any given value 
of x by replacing x in f(x) by the given value. Thus the value of f(x) 
when x = 2 is denoted by f(2). 

The set of numbers over which x may vary is called the range of x 
and is usually the set of points in an interval on the x-axis. This interval 
may be the whole line from —oo to +00. The set of all real numbers 
x between two fixed numbers ὦ and b, written a << x < ὃ means that 
x may take any value which is simultaneously greater than a and less 
than ὁ. This set of numbers is called an open interval. See Fig. 1.1(a). 
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The correct description of a mathematical real number requires more 
precise detail than is given here. It is sufficient to recognize that a real 
number is either a rational number of the form p/g, where p and q are 
positive or negative integers and g + 0, e.g. 0, 4, —1, +2, —#; or an 
irrational number that cannot be expressed in the form p/q, e.g. +/2, 
m, ~/(—4). To each and every point of the x-axis there corresponds 
a rational or irrational number x, called simply a real number. 

The set a <x < ὃ of real numbers x consists of all numbers in the 
open interval plus the left end point x = a. See Fig. 1.1(b). The set 
a<x <b is the set of all real numbers between a and ὁ including 
both end points. This is called a closed interval. See Fig. 1.1(c). If x 
may take all values in a range it is called a continuous variable. 

The functional relationship y = f(x) now determines the set of values 
y may take. In general the function may be represented graphically by 
those points whose coordinates (x, y) satisfy y = f(x) for all those 
values which x may take. 


Example (i) y = x/(x?+ 1). 
Here x may be any real number and the graph of the function is 
sketched in Fig. 1.2. 


Fig. 1.2 


Example (ii) y = ν᾿ (x — 1). 
The range of values of x consists in those values for which the function 
has a meaning, so that in this case, x > 1. See Fig. 1.3. 


Example (11) Let f(x) be the sum of x ternis of the geometric series 


l+e+e+ tt... 


Graphically y = f(x) is represented by a set of points which may not . 
be joined by a continuous line. Values of x are positive integers only, 
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and the value of the function for non-integral values of x is meaningless. 
The successive sums f(1), f(2), f(3), . . . are illustrated in Fig. 1.4. 


Fig. 1.3 


2 3 

Fig. 1.4 
Example (iv) It is possible to define a function by different formulae for 
different values of x, as 


V(x — 1) if 1<x<5 
(ORs ie πὸ if x5, 
y 


Fig. 1.5 


4 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


This function is defined for all values of x > 1 except for x = 5. Its 
graph is sketched in Fig. 1.5. 

A most important property of the function f(x) is that to each value 
of x in the range there corresponds one and only one value of αὶ Thus 
the equation y? =x represents the two functions y = 1/x and 
y = —+/x. The expression +/x will always mean the positive square 
root of x when x is real and positive. 


Example (v) The relationship y = 4/x? defines y as a single-valued 
function of x for all values of x. It is also called the absolute value of 
x or modulus of x or mod x, written |x|. Thus 


es _ fx if x>0 
ΝΟ ΈΡΊΣΙΞ ΩΝ το 


Geometrically, the function |x| is the distance from the origin O to the 
point P which represents the number x on the scale of real numbers 
regardless of whether x is positive or negative. This is shown in Fig. 
1.6(a) where the distance OP, is |x,| and the distance OP, is [χα]. 


Po-t--|2c}--- »O-«---~4 204} ---~- >A, 
x2 ay x 
(a) 
<|oc-al-+ 
a 5: ϑ - 5 τ... 
a-2 a so a+2 x 
(b) 
Fig. 1.6 


The modulus symbol may be used to express a range of values of x, 
for example the range —1 < x < 1 is equivalent to |x| < 1. 

Also, |x — αἱ < 2 means that x is within two units of a. This means 
that x may vary between a — 2 and a + 2, so that |x — αἰ <2 is 
the same as a — 2 < x <a +2. This is shown in Fig. 1.6(b). 


1.1.2 Limits, continuity 


Consider a given function f(x) at some particular value of x = a. As 
x approaches the value a, the value of f(x) will usually change steadily 
and approach the value f(a), and it is possible to choose a value of f(x) 
‘as near as we please’ to the value f(a) by choosing x sufficiently near 
to the value a. A different way of saying this is that f(x) tends to f(a) 
as x tends to a. This is written in symbols, 


foo) ->f@ as x—a. 


INTRODUCTION 5 
The value of f(a) is the limit of f(x) as x > a, or 


Lim fx) = f(a. 


If x increases towards the value a, or x — a from the left, the notation 
i@—-f@—90) as x->a-— 
or 


Lim f(x) = fla — 0) 


is used. Similarly if x decreases towards the value a, or x approaches a 
from the right, 


fo—>f@t+0) as x-at 
or 


Lim f(x) = fla + 0). 


Often it does not matter whether x approaches the fixed value a 
from the left or from the right since in both cases the limit will be f(a), 
the value of the function at x = a. For example, consider the graph in 
Fig. 1.5. If the value of a is between 1 and 5 or greater than 5, 


f(a — 9) = f@ = fla + 9). (1) 


When equations (1) are satisfied, the function is said to be continuous 
at the point x = a. For continuity at the point x = a it is not sufficient 
that the limits of f(x) as x tends to a from the left and right should 
both exist and be equal, for the common value may not be f(a). For 
example, consider the function defined by 


f(x) =Owhenx 40, f(x) = 1 whenx =0. 
Then 


Lim f(x) = Lim f(x) = 0, 
z70- ΡΨ 


but this limit is not equal to 710) which is 1, and the function is not 
continuous at x = 0. 
Consider the function sketched in Fig. 1.5. For this function, 
Lim f(x) = 2, Lim f(x) = 5, 
:--5- 1..0͵5δ51 
i.e. equations (1) are not satisfied at the point x = 5 and the function 
is not continuous at this point. There is a ‘jump’ in the value of the 
function as x increases through the value 5. 
Consider also the way different functions behave when the argument 
x increases indefinitely. If the value of the function increases indefinitely 
the function is said to ‘tend to infinity’ as x tends to infinity, 1.6. 
1 )-» ὦ as x > οὐ. 


Fig. 1.8 
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Example (i) Consider the function f(x) = x*. For all values of x this 
function is continuous and as x > +0, f(x)— ©. The graph of 
y = χϑ is sketched in Fig. 1.7. 

The graphs of y = x?” where n is a positive integer are similar to that 
of y = x*. 


Example (ii) Consider the function f(x) = x°. This function is continu- 
ous for all values of x and as x > +00, x3 +00, as x > —@, 
x? —» —o. The graph of y = x? is sketched in Fig. 1.8. 

The graphs of y = x°"*1 for positive integers n are similar to that of 
y= x8, 


Fig. 1.9 Fig. 1.10 


Example (iii) The function f(x) = 1/x is defined and continuous for 
all values of x except x = 0. This example demonstrates two properties 
not mentioned so far; 


(a) asx—>04+, Κ.-» +0, 
asx—>0-, f(x) -—-o, 


i.e. the ‘jump’ in the value of y = f(x) as x increases through x = 0 is 
infinite; 


(b) as x —» οὐ, 1/x — 0, yet zero is not a value of f(x), i.e. the limit of 
a function is not necessarily a value of the function, y = 1/x is never 
zero. 

Similarly the function f(x) = 1/x? is defined and continuous for all 
values of x except x = 0. For this function, f(x) > +00 as x > 0+ 
and as x + 0—. See Fig. 1.9 and 1.10. 


8 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


Example (iv) Consider the function f(x) = [x] where [x] is defined as 
the integral part of x. Thus [3:2] = 3, [—4-2] = —5 and when x is 
an integer, [x] = x. 

This is an example of a step function and at each integer value of x 
there is a point where the graph is not continuous. This function is 
illustrated in Fig. 1.11. 


Fig. 1.11 


Example (v) Consider the function f(x) defined by 


_ {x when 0<x<1l 
70..Ξ 2~x when 1<x<2, 


1 
Fig. 1.12 
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Here y = f(x) is defined only over the interval 0 < x < 2 and 
Τί -- ΟΟ Ξ-χῇὴ Ξ: ΧΙ - 0) ΞΞ 1. 


Hence equations (1) are satisfied and the function is continuous at the 
point x = 1. See Fig. 1.12. 

A function which is continuous at all points in a particular range is 
a continuous function in that range. Examples (i) and (ii) are examples 
of functions which are continuous for all values of x. The functions 
1/x, 1/x? of Example (iii) are continuous for all values of x except 
x = 0; the function defined in Example (v) is continuous for all values 
of x for which the function is defined, 1.6. in Ὁ < x < 2; the function 
[x] of Example (iv) is not continuous at any integer value of x. 


1.1.3 Odd and even functions 


The function f(x) is an even function if f(—x) = f(x) for all values of x. 
The graph of an even function is symmetrical about the y-axis, as x?, 
1/x?, cos x. 

The function f(x) is an odd function if f(—x) = —f(x) for all values 
of x. The graph of an odd function is symmetrical about the origin, 
as x°, 1/x, sin x. 

Any function can be written as the sum of an odd and an even 
function, for 


70) = 100 + f(-9} + HC) — f(—»}. 


1.1.4 Monotonic functions 


The function f(x) is a monotonically increasing function, or f(x) increases 
monotonically in an interval of values of x if f(x) increases steadily as 
x increases through the interval, i.e. if f(x2) > Τα whenever x, > x}. 

Similarly f(x) is a monotonically decreasing function if f(x) decreases 
steadily as x increases through an interval, i.e. if f(x.) <_f(x1) whenever 
Χ > X4. 


1.2 INVERSE FUNCTIONS 


Suppose y = f(x) is a continuous function of x, then it is sometimes 
possible to express x as a function of y. For example, the formula 
y = mx +c determines y uniquely when x is given, so that y is a 
function of x from the definition of function given in section 1.1. Also 
x = (y — c)/m is a function of y. 

Consider the function given by y = x?; y is determined uniquely 
when x is given, for all values of x. If x is expressed in terms of y there 
are two possibilities, x = + +/y and x is not uniquely determined when 
y is given. Hence from the definition of a function, x cannot be expressed 
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as a function of y but is expressed as two functions of y for all positive 
values of y. 

If a given function y = f(x) can be replaced by a function x = ¢(y), 
then x = ()) is called the inverse function of y = f(x). For this to be 
possible, to each value of x there is one and only one corresponding 
value of y and to each value of y there is one and only one corresponding 
value of x. 


Example The function y = x?, -2 <x <2, has no unique inverse 
function since x = +1/y gives two functions in the range. 

The function y = x?, 0 <x <2, has the unique inverse function 
x = +/y. These functions are sketched in Fig. 1.13 and 1.14. 


Fig. 1.15 


Suppose y = f(x) is defined in the range a < x < ὃ and has a graph 
as in Fig. 1.15. For each value of x in the range y is uniquely determined 
but to each value of y there may be more than one corresponding 
value of x. Hence this function cannot be inverted in the whole range 
axx<b. 
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In the range a < x < 6, x is uniquely determined when y is given, 
f(x) is monotonically increasing and hence x can be expressed as a 
function of y ina < x < ς. Similarly x can be expressed as a (different) 
function of y in the range ὁ < x <d where the function is mono- 
tonically decreasing and in the range ὦ < x < δ where the function 
is monotonically increasing. 

More generally if y = f(x) is continuous and monotonic in an 
interval then it is possible to define the inverse function x = 4(y). 
Consider the graphs of y = f(x) and x = $(y) where y = f(x) is con- 
tinuous and monotonically increasing ina <x <b, anda<y<f. 
These graphs are sketched in Figs. 1.16 (a) and (b). 


Fig. 1.16 


Geometrically the graph of the inverse function is drawn by reflecting 
the graph of y = f(x) in the line y = x, or by ‘rotating’ the axes of 
the graph of y = f(x) through 90° and reflecting in the new x-axis. 


1.2.1 The inverse circular functions 


The function y = sin? x or y = arc sin x is defined as the value of y 
such that 


x=siny and --ἰπ-ν «- ἐπ. (2) 


That is the function x = sin y is inverted only in that range where 
there is a one to one correspondence between values of x and values 
of y. There are many ranges for y where this is possible. The range 
chosen in equation (2) is that one which includes y = 0. 

Note that if x = sin y there are many values of y which satisfy the 
equation for a given value of x. For example if sin y = 4, then 
) ΞΕ ῆπ + (—1)"o7, 2 =0, ΕἸ, 2,.. .. But sin-1(4) = ἐπ. Figure 
1.17 shows sketch graphs of x = sin y and y = βίη ἷ x. 

Similarly the expression x = cos y is many-valued and to obtain 
the full range of values of cos y once and once only, the range is 
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(a) (b) 
Fig. 1.17 


restricted to Ὁ <y <7 where the function cos y is monotonically 
decreasing, i.e. y = cos~1 x or y = arccos x is defined to be that value 
of y such that 


x=cosy and 0 «ν «π. (3) 


Figure 1.18 shows the graphs of y = cos~? x and x = Cos y. 


y 


y= cos"! x 


(a) (b) 
Fig. 1.18 


The expression x = tan y is many-valued but takes all values once 
only in the range --π < y < 47 where it is monotonically increasing. 
Hence the inverse function y = tan7? x or y = arctan x means 


x=tany and --ῸᾷτἮἧτ «ν «- ἐπ. (4) 


Figure 1.19 shows the graph of y = tan7? x. 
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y=tan oc or xc=tany 


mee ee ee ee = ats “ὠς νει προς ων ες το σὰ 
2 ΠΤ 


Fig. 1.19 


The difference between βίη 1 x and (sin x)~! = cosec x should be 
noted. 


Example Prove that tan~! 2 + tan~! 3 = 37. Is it true that tan-+ 2 + 
tan"? 3 = tan" (—1)? 
Solution Let tan-+2 = a, tan"? 3 = 8, then 
tana+tanB 243 : 
BT tramp α τὸ πὰ ῶ 
Now by definition of the inverse tangent, —47 <a « ἐπ and 
— 7 < β < ἐπ. Moreover since tan « > 1 and tan 8 > 1 both « and 
B are greater than 2π, ie. dr <a < ἐπ and }7 < B < ἐπ hence 
ἐπ <a + β <7, and this with equation (i) means « + β = ὃπ. 
By definition, tan-'(—1) = —}m so that tan-12+tan135 
tan-+(—1). 


1.3 COORDINATE SYSTEMS 
1.3.1 Coordinate systems in the plane 


The position of a point P in the plane is fixed when its rectangular 
Cartesian coordinates (x, y) are known. These are coordinates referred 
to two perpendicular axes Ox, Oy through an arbitrarily chosen point 
O called the origin. The x-coordinate is called the abscissa and the 
y-coordinate the ordinate of the point. 

The position of the point P in the plane is also fixed if its distance r 
from a fixed point O in the plane and the angle @ the line OP makes 
with a fixed straight line in the plane are known. The point O is called 
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the pole, the fixed straight line is called the initial line. The distance r 
is always positive and the angle is measured positively, or anti-clock- 
wise, from the initial line so that the direction is specified by the angle. 
The point P is then given the polar coordinates (r, 6). 

Polar and Cartesian coordinates are transformed one to the other if 
the origin is used as the pole and the positive x-axis is the initial line 
by the equations 


x=rcos#, y=rsiné, 
These equations may also be expressed. 
r= V(x? + y?), cos 9 = x/r, sin 6 = ylr. 


See Fig. 1.20. It is not sufficient to consider tan 6 = y/x since the 
value of 6 is not then uniquely determined. For example, if both the 
x- and y-coordinates are negative the point is in the third quadrant, 
yet tan 6 is positive giving the point (r, θ) in either the first or the third 
quadrant. A rough sketch will often prove useful. 


Example (i) The point P whose Cartesian coordinates are (1, 1/3) has 
polar coordinates (2, 47). 


Example (ii) The point Q whose Cartesian coordinates are (—1, \/3) 
has polar coordinates (2, $7). See Fig. 1.21. 


a (2, ξ πὶ P (2,47) 


Initial tine 


Fig. 1.20 


1.3.2 Coordinates in three dimensions 


(i) Cartesian coordinates A point P in three-dimensional space is 
specified by the coordinates (x, y, z) referred to three mutually perpen- 
dicular axes through an origin O. The set of numbers specifying the 
coordinates of P may also be written (x1, Χο, X3). See Fig. 1.22. 


(ii) Cylindrical polar coordinates In three dimensions the point P is 
fixed when the cylindrical polar coordinates (p, ᾧ, z) are known, where 
z is the perpendicular distance of P from the plane (xy) and (ρ, ¢) are 
the polar coordinates of the projection N of P in the (xy) plane. See 
Fig. 1.23. 
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The relations connecting cylindrical polar coordinates with Cartesian 
coordinates are 


x = pcos ¢, 
y =psing, 
z= Ζ. 


Ζ 


Fig. 1.22 Cartesian coordinates 


P(p,$,2z) 
4 


Fig. 1.23 Cylindrical polar coordinates 


(iii) Spherical polar coordinates The point P in three-dimensional 
space is fixed by the spherical polar coordinates (r, 0, ¢), where r is 
the distance of P from the origin O, θ is the angle made by OP with 
the positive z-axis and ¢ is the angle made by the projection ON of 
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OP on to the (xy) plane with the positive x-axis. The transformation 
from (x, y, z) coordinates to (r, 8, 6) coordinates is given by 


x=rsin@cos¢, y=rsinOsing, z=rcos 6. 


Note that ON = r sin 6 = p. See Fig. 1.24. 


Fig. 1.24 Spherical polar coordinates 


1.3.3 Coordinates in more than three dimensions 


The set of numbers (x, y, z) or (x1, X2, X3) determines the position of 
a point in space referred to rectangular axes through an arbitrary origin. 
Suppose the point is specified by a fourth variable also, then the set of 
numbers (x1, X2, Xs, X4) will fix the point completely. This set may be 
written more shortly as (x,), where i = 1, 2, 3, 4. The fourth coordinate 
could, for example, be the time f. 

It may be that a set of m numbers is required to describe a point 
uniquely. Then the set of numbers (%1, X2, X3,.. .» 8) OF (x), 
i= 1,2,3,.. ., describes the point completely. 


1.4 FUNCTIONS OF TWO VARIABLES 


When three quantities are so related that one of them is uniquely 
determined when the other two are known, the first quantity is a 
function of the other two quantities. For example the volume of a 
cylinder is a function of its radius and height given by the formula 
V = πρῶ". The variables r and ἢ are independent; if one depends on 
the other, then V is a function of one variable only. 

Consider the function ὦ = f(x, y), where x and y are independent 
variables, i.e. ὦ is uniquely determined when x and y are given, and is 
defined over some range R of values of (x, y). If a geometrical interpre- 
tation of w = f(x, y) is required, then the number pair (x, y) may be 
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represented by a point N of rectangular coordinates (x, y) in the region 
R of the xy-plane. The value of the function ὦ or f(x, y) is then repre- 
sented by a line NP drawn perpendicular to this plane at the point N 
of length equal to f(x, y). Thus as N varies over the region R the 
corresponding point ὦ = f(x, y) varies over a surface consisting of all 
points P with Cartesian coordinates (x, y, f(x, y)). 


15 RULES FOR INEQUALITIES 

(Ὁ Ifa <bandb<cthena<c, 

(ii) If a<b then a+x<b+-~x for every x, ie. the inequality 
relationship is unchanged when the same number (positive or negative) 
is added to both sides. 

(iii) If a < b then pa < pb when p is positive and ga > gb when q is 
negative. In particular if a << ὃ then —a > —b. This is easily seen if 
a and b are considered as points on the x-axis. 


Fig. 1.25 


In Fig. 1.25 the inequality a < 6 means that a is to the left of 6 and 
—a is to the right of —b on the line in which numbers increase from 
left to right. 


For example, 3<4, —-3> —-4; 

6 < 8, —3/2> —4/2. 
Thus the inequality is unchanged if both sides are multiplied by a 
positive quantity; but the inequality is changed when both sides are 
multiplied by a negative quantity. 
(iv) If a < ὃ, and ab is positive, then 1/a > 1/b. 
For example, 3<4 and 4>4; 

—3<-2 and -ἐ5 - 


EXERCISE 1 
1 Find the values of f(2), f(—1), fU/x), f(x + h) when f(x) = x? + 1/x. 
2 Find the values of /(0), f(37), f(37/2) when f(x) = sin x + cos x. 
3 Solve the following equations for x in terms of y and discuss the 
possible ranges of x and y. 


: χες τὸς 

ὦ γπτττ: (ἢ x? + xy + y? Ξ 3; 
x? 

x? — 1 


(ii) yy = 


(iv) y= 


χ--Τ' 
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4 Investigate the continuity of the following functions at x = 0, and 
draw a rough sketch of the graph of y = 1) in each case: 
@ fo) = |x]; Gi) fo) = 1G? + 2x); 
ἘΞ Ὁ when χεῦ 
IQ) (i when x>0; 
: x when x <0 
(iv) fle) = [ —x when x>0. 


5 Show that the function x + [x] is not continuous at any integer 
value of x and draw a rough sketch of the graph of y = x + [x]. 


6 (i) Sketch the graph of y = 1/(x — 1). 
(ii) Sketch the graph of y = f(x) where f(x) is defined by 


ve x—1 when O<x<2 

i= 3—x when 2<x « 3. 

7 Show that (i) x? < 1 is equivalent to |x| <1; 
(ii) if x? > 1 then x > lorx < —1. 


8 Describe the range of the variable x without the symbol ‘modulus’ 
in each of the following: 


ὦ |x| <3; (iii) |x — 2] <4; 
(ii) |x| > 3; (iv) |x — 3| <3 and at the same time 
|x -- 4| >2. 


9 Given that 0 = sin-! (—}4), find the values of cos 0, tan 6, sec 6, 
cosec Ὁ. 


10 Given that 6 = cos-! (—4), find the values of sin 6, tan 0, sec 6, 
cosec 6. 


11 Evaluate (i) sin~? (1) — sin-1(—1); 
Gi) (άπ: (1) — tan-?(—1); 
(iii) 560" (2) — sec-+ (—2). 

12 Describe and sketch the locus of points P whose rectangular 
Cartesian coordinates are (x, y, z) and which satisfy the following 
pairs of simultaneous equations: 

(i) x = constant, y = constant; (iii) x? + y? = 9, z= —3; 
Gi) y= x, z= 4; (iv) y = 0, x?/a? + z37/b? = 1. 
13 Describe and sketch the locus of points P whose cylindrical polar 


coordinates (p,¢,z) satisfy the following pairs of simultaneous 
equations: 


@p~=3,z=2; (ὃ d= t,z=p. 
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14 Describe and sketch the locus of points P whose spherical polar 
coordinates (r, 9,¢) satisfy the following pairs of simultaneous 
equations: 

ὦ r=5,¢= 4; (δ $ = 40, 6 =r; 
Gi) r=4,0= hb; (iv) 6 = ἐπ,γ = 408 8. 

15 Transform each of the following equations from the given co- 
ordinate system into forms which are appropriate to the other two 
systems: 

(Ὁ 2+ y+ 227 = 4; (iii) z? = p?; 
(ii) x? + y? + 2? = 9z; (iv) r = 4cos 8. 


2 
Differentiation 


2.1 DEFINITION OF THE DERIVATIVE 


2.1.1 Formal definition 


Let y = f(x) be defined and continuous for all values of x in the interval 
a<x < band let x, be a value of x in this interval. Then the derivative 
or differential coefficient of the function at xo is defined as 


imfta + ἢ το fle) is 
rer 0 


when this limit exists. It is denoted by ᾿ Χο). At each point x. where the 
limit is finite the function is said to be differentiable at the point. When 
the derivative exists over the whole range of values of x the function is 
differentiable in that range. 

The value x» may be any allowable value of x, consequently it is usual 
to write x instead of xo in equation (1) and define the derivative of f as 


ΤῸ = Lim fet A= fo (2) 


remembering that x is to be sali constant, while ἢ varies and approaches 
zero. 
Example Find from first principles the derivative of 
70) = χ — 3x + 3, 
where — οὐ <x « ©, 
Solution f~at+A=%+ Ah? —-344+4)4+3 
so that f(x + h) — f(x) = 3x°h + 3xh? + h8 — 3h 


OS Ba LE = (3x? — 3) + h(3x + A). 


As h — 0, 3x? — 3 is held constant while A(3x + ἢ) > 0 and 
ΤΟ.) = 3x? — 3. 


and 


2.1.2 Geometrical interpretation 
Let P(xo, Χο) be any point on the curve y = f(x) and 


Oxo + h, flo + h)) 
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be another point on the same curve. Then in Fig. 2.1 the slope of the 
line joining the points P and Ὁ is QR/PR where 


QR - 30. + » — SX). 
PR 


If the value of this quotient approaches a finite limit as Q — P along 
the curve, P being held fixed, then the slope of PQ tends to the slope 
of a line through P which is defined to be the tangent at P. Then 


fx + - fe) 
fo Tepe 


is the slope of the tangent to the curve at P, or more briefly, the slope 
of the curve at P. 


A aie 


ο 
ᾧ 
+ 
> 
a 


Fig. 2.1 


2.1.3 Notation 


It is sometimes convenient to denote an increment in the value of x 
by 6x (= A), an increment of the function f by df or of y by dy, so that 


Of(Xo) = f(Xo + 6x) — f(xo); (3) 


and substituting in equation (1) 


ieee’) τς ὅδ) 
S'(%o) = at oe (4) 
= oy 
= tin (5) ϑ 
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The derivative of y = f(x) when it exists is a function of x and is 
written f’(x) or γ΄. From the right hand side of equations (4) and (5) is 
derived the notation 


d dy 
ae f@ or a 


where xp is replaced by the more general x. The quotient form dy/dx 
derives from equation (5) and is a limit of a quotient. It is not dy + dx. 
The symbol d/dx, regarded as an operator which when applied to f(x) 
gives f’(x), is sometimes written D or D, and must operate on some 
function of x. Hence Dy means dy/dx. Thus x Dy and y Dx are entirely 
different, for x Dy = x dy/dx and y Dx = yd(x)/dx = y. 

The limit f’(x — 0) if it exists is called the derivative on the left, and 
similarly f’(x + 0) is called the derivative on the right. 


2.1.4 Continuity 

Continuity of the function f(x) is necessary for the existence of f’(x) 
since the limit in equation (2) cannot exist unless f(x + ἢ) > f(x) as 
h - 0, but this condition is not sufficient. For if f’(x — 0) and f’(x + 0) 
both exist and are unequal, then the derivative f(x) does not exist. 


Example Consider the differentiability of the function f(x) = |x — 1] 
at the point where x = 1. 


Solution For the graph of y = |x — 1 see Fig. 2.2. 


Fig. 2.2 


The function is continuous for all values of x including x = 1 but 
fd — 0) = —1 and ΣᾺ + 0) = 1; ie. the derivative at x = 1 does 
not exist and the function is continuous but not differentiable at x = 1. 


2.1.5 Higher derivatives 


If f'(x) exists then it is a function of x which may or may not be 
differentiated. If f’(x) has a derivative it is denoted by f’(x) or d*y/dx? 
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or D?y and it is the second order derivative of y = f(x). Similarly third, 
fourth, up to nth order derivatives may be defined: 
If y = f(x), then 


7 ἄν ὦ = d d™-+y 
and Dry or τα =f w=5(S): 


2.2 STANDARD DERIVATIVES AND RULES 


2.2.1 The derivative of a constant 
Let f(x) = c where c is a constant, then f(x + A) = c and 


fle + A) — f(x) =0. 


Hence f@= μενα +9 -- ΔὉ = 0, 


h>O 
| 1.6. the derivative of a constant is zero. 
2.2.2 The derivative of x" 
Let f(x) = x" where n is any positive integer, then 
Seth = + A” = χ' + nxt th + ἢ (terms in x and A). 
If h £0, {{ὰ + A) — (Οὐ = ηχ" ᾿ + ἢ (terms in x and A). Now 
let ἢ -- 0, then f’(x) = nx""?. 
2.2.3 The derivative of sin x, cos x 


(i) £ (sin x) = cos x. 


From the definition, 


sin(x +h) —sinx 2cos (x + ἐδ) sin $h 


h h 
sin $h 
= cos (x + $A) τ : (6) 
Since cos (x + 11) > cos x as h -- 0 and since 
. sin 8 sin $h 
1 πὸ Π Ξε 1, Τῇ -»1 as h-O 
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then the limit as ἢ -> Ὁ of equation (6), or the derivative of sin x, is 
COs x. 


. da 
(i) Te (cos x) = — sin x. 


From the definition, 
cos(x +h)—cosx  —2sin (x + 4h) sin th 
h ἮΝ h 
th 


—+>-—sinx.1 as h—O. 


— sin (x + Μὴ 


2.2.4 The derivative of cf(x) 


If f(x) is a differentiable function of x and c is a constant, then 
d : 
K(f} = of". 


Let d(x) = ef(x), then for h 40 
He +h — $0) «Ὁ +h -- fe) 
h h 


f(x +h) -- Ὁ) 
: | h | 
Let h + 0, then f'(x) = Οὔ Οὐ. 


2.2.5 Differentiation of sums 


The derivative of the sum of a finite number of differentiable functions 
of x is equal to the sum of their derivatives, and more generally if f(x) 
and g(x) are differentiable and a and ὁ are constants, 


d , 

ἧς (40) + bed} = φῦ) + bg’). 
This is easily proved using section 2.2.4 and the definition of a deriva- 
tive, and can be extended to a linear function of any number of differenti- 
able functions. 
Example SQ) = x8 + 7x? — 5x +4 

d d d d 
ω ΕΞ ΩΝ 9 paste, 2. patio ates 
(O=FEOM+756)-5E0O+ EO 


= 3x? + 14x — 5. 
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2.2.6 Differentiation of products 


The derivative of the product of two differentiable functions f(x) and 
g(x) is f’(x)g(x) + f(xg’(x). For let d(x) = f(x)g(x) and A τῇ 0 then, 


(x + ἢ) — d(x) 
h 
_ fae +h) gx + A) —f) g@ 
- h 
Κα +h) gx +h) -- ὰ τ Ἦ ἢ 50) + [ἃ +A eg) — [ΟἹ 8) 
7 h 


EE DS s0)] fas ἘΝ 0} 


= f(x + h) + g(x) 
Since f(x) and g(x) are differentiable and hence continuous, as h — 0, 
it follows that 

$'(x) = (λε΄ Οὐ) + gf’). 


2.2.7 The derivative of the reciprocal 1/g(x) 


The derivative of the reciprocal of g(x) is —g’(x)/{g(x)}* at a point 
where g(x) 0, for, let d(x) = 1 ae h #0, then 


φα + ἢ — $) _ Ι τα] 
(x + A) g(x) 
= ees + ἢ) — ΟῚ} 
τς φᾷ + Adg(x) 
__-! ete +h) - [6 
g(x + h)g(x) h : 
το 
eis PO) = Κ᾽ 


The proofs of the formulae in sections 2.2.6 and 2.2.7 depend on the 
assumption that the limit of a product (or quotient) is the product (or 
quotient) of the limits. 


Example Show that the derivative of x" is nx"~! when n is a negative 
integer. 


Solution Let n = —m where m is a positive integer, then 
1 


= χῖ = yom eS .-, 
Yas ee ee 


Applying the above result and using section 2.2.2, gives 


dy — —mx™? 
dx xam 
= (=m)x-- 


= nxn} 
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Hence 


= χὸ = nx. 


dx 


2.2.8 Differentiation of a quotient 


The derivative of the quotient of two differentiable functions f(x) and 
g(x) follows by applying the formulae in sections 2.2.6 and 2.2.7 for, 
let 


20) 1 
ΞΖ ----ὄ- --- x 
PO) = Gy Δ χρῷ 
then at a point where g(x) τῇ 0, 
κα, 80) [Ὁ) -- ΠΟ) 50) 
Example (i) © (tan x) = sec® 
xample (i) ἡς (tan x) = sec? x 
: sin x 
since tanx = 
cos x 
‘ iis oe CORO oe Sines 
i ee (cos x)? 
= sec? x. 
Similarly (cot x)’ = — cosec? x. 
Example (ii) 4 (sec x) = sec x tan x 
since sec x = 1/cos x 
(sec x)’ = (cos x)? using section 2.2.7 
= sec x tan x. 
Similarly (cosec x)’ = — cosec x cot x. 


2.2.9 Differentiation of a function of a function 


Suppose fis a function of g and g is a function of x then fis a function 
of a function of x and 


[fle] = οὐ () 


at a point x where g(x) is differentiable and fis differentiable at g(x). 
The symbol f’{g(x)} means the derivative of the function f with respect 
to the bracket {} or g(x), at g(x). 
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Writing g(x) = ¢, then y = f(¢) and equation (7) may be written 


dy ἀν dt 
dx dt de ®) 
For if dx #0, δι = g(x + Ox) — g(x), 
dy = f(t + δὴ — f(), 
ὃν ὃν δι. 
and Fee Var if é6¢ 40 
or oy = if é6¢=0. 
Ox 
If 6¢ 4 0, as dx > 0 
dy _ dat 
dx  adtdx 
If δὶ = 0 but dx #0 
ot dy 
ὃχ ΞΞ and dx = 0. 


A rigorous proof of this may be found in a more advanced text-book. 
It follows that the derivative of f(ax + 5), a and ὁ are constants, is 


af'(ax + δ). (9) 

Example (i) If y = cos (3x + 2), 
then y = —3 sin (3x + 2). 
Example (ii) If f(t) is an arbitrary differentiable function and g(x) is 
such that 

y = flg(x)] = x 
then (8) becomes 

1 = (dx/dt)(at/dx) 
ie. dt/dx = 1/(dx/dt), provided dx/dt γε 0 
(see also section 2.6). 
Example (iii) If y = tan? x = (tan x)’, 
then y’ = 2 (tan x) sec? x. 
Example (iv) If y = cos? (sin 3x) = {cos (sin 3x)}’, 
then y’ = 2{cos (sin 3x)}[— sin (sin 3x)] 3 cos 3x 


using equation (8) repeatedly. 


2.2.10 Parametric differentiation 
When y = f(x) and x and y are both functions of a single variable, 
say t, called a parameter, ie. x = x(t), y = y(t), and x and y are 
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differentiable then, 
dy ὦν αὶ 
dx dx|dt 
This follows from (8) and Example (ii) of section 2.2.9 and is useful 
when a curve is given parametrically in the form 


x=x(t) y=y(t). 
Example Find dy/dx and d*y/dx? as functions of t when 


provided S #0. (10) 


x =a(t—sint), y=a(l —cos?). 


Solution x = a(t — sin ἢ) 
dx/dt = a(1 — cost) 
and y = a(l — οοβ ἢ 
dy/dt = asin t, 
and hence, wy = ee 
asin t 
~ a(1 — cos ἢ 
sin t 
~1T—cost 


provided 1 — cost £0. 
This may be simplified using the half-angle formulae of trigonometry 


= dy _ 2sin $tcos ft _ 


τ τη 11’ 


Differentiating again and using section 2.2.9, 


d’y τς 2) = at 
dx? ~ ax (ὦ G (oot 1) 


= —}cosec? $f ————_~ 
2 COSeC” 3! 0 -- cost) 


1 
= — — cosect dz. 
4a = 


d? 
᾿, is aia to ee? the procedure for finding = as 


50" Ore The expression = -- © function of ¢) must be calculated as 
‘ (function of 1) τ 
2.2.11 Implicit functions 


These are given in the form f(x, y) = 0 or f(x, y,c) = 0 where c is 
some number which may be a parameter, that is as equations which 
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involve x and y in such a way that y is not given explicitly in terms 
of x, e.g. x? + y? = 1; x8 + 4xy® — 3γ = 2. 

It is possible to calculate dy/dx from such an equation by implicit 
differentiation using the previous rules. 
Example (i) Find dy/dx at any point (x, y) on the curve given by 
x® + 4χγϑ — 3y® = 2. 


Solution Differentiate both sides of this equation with respect to x, let 
D be the operator d/dx 
D(x®) + D(4xy*) — DBy*) = D(2) 


5x* + 4{y° + xDy*} — 3D(y*) = 0 
5x* + 4y? + 4x3y?Dy — 3 x Sy*Dy = 0. 
Hence at points (x, y) where 15y* — 12xy? 4 0, 
dy _Sx*+4y*__ 
dx 15y* — 12xy? 


Example (ii) Find dy/dx if γ᾽ = 2x — 1. 


Solution 
(a) Differentiating both sides of the equation with respect to x, then 
dy 
2 -- = 
3y ae 2 
dy 2 
or dx a 3y? 
(b) Write x = 4y°+4, then 
ce ὩΣ 
dy 2 
Oe te 
dx 3» 


Thus dy/dx can be calculated as the reciprocal of dx/dy provided y is 
a differentiable function of x and x is a differentiable function of y. 


Example (iii) Show that the derivative of x" is nx"~* when ἢ isa rational, 
non-zero fraction. 


Solution If n is rational it is of the form p/q where p and gq are integers 
and g # 0. If p #0 and p,q have no common factor then x is of the 
required form. 
Then y=xt = xP 
and yt = xP 


where p and 4 are non-zero integers. 
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Differentiating with respect to x, 


dj 
gy = px?-* 


or 2 = ) xP-1- @ig@-1) 
= (2) xPiq-1, 
q 
— xh = pyt-1 
Hence a Ἔχ 


2.3 REPEATED DIFFERENTIATION, LEIBNITZ’ 
THEOREM 


2.3.1 
Consider first some examples. 
Example (i) Show that the nth derivative of sin ax is a" sin (ax + ππί2). 
Solution Let f(x) = sin ax 
Sf’) = acos ax 
= asin (ax + ἐπ) 
ie. f(x) = a" sin (ax + ππ|2) is true when ἡ = 1. The method of 
induction is used for the proof. Suppose that for some fixed integer 
value of ἢ, 
ΡΟ) = a" sin (ax + ππ|2) 
Differentiating both sides, 
ft?) = a"*? cos (ax + ππ|2) 
= at) sin (ax + ππ|2 + 77/2) 
= a"*? sin {ax + (n + 1)π|2} 
i.e. the result is true when n is replaced by n + 1. Since it is true when 
Ἢ = | it is true for all integer values of n > 1. 
Example (ii) Find the nth derivative of sin 2x when x = 0. 


Solution D" (sin 2x) = 2" sin (2x + ππ|2) using example above. When 
x = 0, D" (sin 2x) = 2" sin (n7/2) and this has different values accord- 
ing as n is even or odd. 

When ἡ is even, write n = 2k: 


at x = 0, D?* (sin 2x) = 2?" sin kar = 0. 
When n is odd, writen = 2k +1: 
atx = 0, D***?1 (sin 2x) = 2?**1 sin (2k + 1)π|2 


= 22k+1 ς- 1)*. 
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2.3.2 Leibnitz’ theorem 
Let f and g be functions of x which may be differentiated ἢ times. Then 


D(fg) = δε + gf. 
Repeatedly applying the rule for differentiating products and using 
D™ for the operator d"/dx", 


D°(fg) = fD°g + 2DfDg + gD*f 
D°(fg) = fD°g + 3DfD°g + 3D°fDg + gD*f 
D*(fg) = fD*g + 4DfD°g + 6D?fD’g + 4D°fDg + gD*f. 


These coefficients are binomial coefficients and the result 


D™ (fz) = {Dg +. (7) Df D*-1g le eee ( ρα τὴν 
+...+gDS (11) 


for any positive integer ἢ is implied and may be proved by induction. 
It is called Leibnitz’ theorem. 


Example Find D{x? sin x}. 


Solution 
D™{x? sin x} = x?.D™ (sin x) + nD(x?)D*~} (sin x) 
+ dan — 1)D*(x?)D"- Asin x) 
= x? sin (x + 47) + n2x sin {x + ἐξ — 1)πῈ 
+ ἐπι — 1)2 sin {x + Hn — 2)πὶ, 


since D(x?) = 0 when n > 2 and D™ (sin x) = sin (x + 4m) from 
example (i) of section 2.3.1. 


2.4 THE INCREMENT OF A FUNCTION 

Consider an estimate of the change dy produced in a function y = f(x) 
when x changes by a small amount 6x. Let P be the point (x, y) on the 
curve y = f(x) where the function is differentiable, and let Q be the 
neighbouring point (x + dx, y + dy) where 6x ¢ 0. Then the slope of 
the chord PQ is dy/dx which approaches the limiting value dy/dx as 
6x -> 0. Hence the difference between dy/dx and dy/dx is numerically 
small when |dx| is small. Let this difference be denoted by ε, 1.6. 


ὃν ἡ 
ὅς -42-Ξ (12) 
Then the statement 
ὃν di 
Lim 2 = 2 = fon (13) 
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is equivalent to the statement 


Lim δ = 0 (i4 
éz->0 
or ne aw +e where «0 as 6x—>0, (15) 
ὃχ dx 
and multiplying by 6x 
ὃν = ᾿ ὃχ + εὃχ (16) 


Although equation (16) was derived under the assumption that 
dx # 0, it is still true even when 6x = 0, since it is only necessary to 
define ε = 0 when 6x = 0. 


Fig. 2.3 


Consider geometrically the graph of y = f(x) as sketched in Fig. 2.3. 
The first term on the right hand side of equation (16) is represented by 
the length TR, since the slope of the tangent PT is dy/dx, and this is 
the change produced in y along the tangent at P. When ὃχ is small, 
this term is usually large compared with the second term εὖχ. The 

d 
difference between QR = = 6x + edx and TR = 2 6x is εὃχ which 
tends to zero more rapidly than 6x does when 6x ΦΗΣΙΝ zero. 

For example, if y = f(x) = x?, dy/dx = 2x = f'(x) and 


dy 
x 6x = 2x dx. 


On the other hand the exact value of dy is given by 
ὃν = 2x dx + (6x)? = f(x + 6x) — f(x). 
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Comparing this with equation (16), for this particular example, 
εὸὃχ = (6x)?. When 6x is small, (6x)? is smaller than 6x, and dy is 
approximately equal to 2x dx. This example may be illustrated geo- 
metrically as in Fig. 2.4. 


μ- x wl δα. >| 


Fig. 2.4 


The original square has side of length x and hence area y = x”. When 
the sides are increased to x + 6x the area is increased by dy = 
2x dx + (6x). When dx is small compared with x most of the change 
in y is given by the two rectangular areas and only a very small part is 
given by che small square of area (6x)?. 

Hence if the original square has side 2 cm say and the larger square 
side 201 cm, then x = 2, x + 6x = 2-01, dx = 0-01. The area of the 
original square is y = x? = 4 and the area of the larger square is 


y + oy = (x + dx)? 
= x? + 2xdx + (6x)? 
= 4+ 0-04 + 0-0001. 


The increment dy = 0-04 + 0-0001. Hence if 2χὸχ = 0-04 is used as 
an approximation to dy there is an error of 0-0001 square centimetres 
which is just under 4% of dy. 

Since the term (dy/dx)dx usually gives a good approximation to dy 
when 6x is small compared with dy/dx, it is customary to use the 
approximation 


ty = 8x (17) 


or Of = f'(Xo)dx (18) 
in numerical calculations. 
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Example Using the approximation in equation (17) determine a reason- 
able approximation to (2-98)*. 


Solution Let y = χϑ and start from the point x = 3, y = 27, 
χα dx = 2:98, yt dy = (2198), 


Using equation (17) to find an approximate answer, with 6x = — 0-02 
at x = 3, Oy = (dy/dx)dx = 3x?dx 
= —0:54 
hence (2:98)? = y + dy = y + (dy/dx)dx 
= 27 — 0-54 
or (2:98)? = 26-46. 


2.4.1 The differential 
Equation (16) may be written 


Of = f'(X)dx + εδχ. (19) 
If Οὐ τ 0, the differential df at x = Xp is defined by 

df = f'(Xo)6x (20) 
and is represented by TR in Fig. 2.3. The differential of the function x 
" dx = ὃχ (21) 
so that equation (20) may be written 

df = f'(xXo)dx 
or dy = f'(Xo)dx. (22) 


Thus the differential dy at x = x, is the amount of change in y along 
the tangent line at the point x) which would be produced by a change 
dx in x; dy/dx is the rate of change of y per unit change in x. 


2.5 APPLICATIONS OF DIFFERENTIATION TO CURVE 
SKETCHING 
2.5.1 The sign of 7΄ (6) 
Let y = f(x) be continuous and differentiable in the range a <x - ὃ. 
If f(x) increases monotonically in this range, for any value of x in 
a<x<b,[f(x + h) —f(x)I/h has the sign of f’(x) provided h is small 
enough, and hence f’(x) > 0 in the range. 
Similarly if f(x) decreases monotonically, then f(x) < Oina<x <b. 
See section 1.1.4 and Figs. 2.5(a) and 2.5(b). 


2.5.2 Concavity 

Let f’(x) and f(x) both exist in the range a <x <b. The graph of 
y = f(x) is said to be concave downwards if f’(x) decreases, i.e. f"(x) < 0 
as x increases through the range. The graph of y = f(x) is concave 
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upwards if f’(x) increases, i.e. f"(x) > 0 as x increases throughout the 
range. See Figs. 2.6 and 2.7. 


(a) (0) 


Fig. 2.5 
y y 
Ι \ 
! I 
Ἰ ! 
| I 
| I 
‘ ' 
' I 
᾿ . 
f'(ac) decreasing f(x) decreasing \ 
' “ 
Γ f(x)<O ἢ f(x) <0 Ι 
Ι ! ! 
a Le aE ee τὸς τ τορος, 
Οἱ α δ x ie) a b x 
(a) (b) 


Fig. 2.6 Graphs of y = f(x) concave downwards ina <x <6 


4 


ΤΕ (ac) increasing 
'f"(ac) >O Ι 


f’(2c) increasing 
f(x) >O ! 


(a) (b) 
Fig. 2.7 Graphs of y = f(x) concave upwards ina <x <b 


When the graph is concave downwards the tangent at each point is 


‘above’ the curve, and when the graph is concave upwards the tangent 
is ‘below’ the curve. 
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2.5.3 Maxima, minima 

A maximum value of a continuous function is one which is greater 
than the values in the immediate neighbourhood. A minimum value is 
one which is less than those in the immediate neighbourhood. Thus the 
maximum and minimum values of a function refer to local properties 
and it is possible for a maximum value to be less than a minimum 
value. See for example, Fig. 2.8 where the maximum value at B is less 


y 


Fig. 2.8 


than the minimum at A. Moreover, maximum and minimum values are 
not necessarily the greatest and least values of the function in any given 
range. For example, in Fig. 2.8, the greatest and least values occur at 
the beginning and end of the range. 

A point where ἃ maximum or minimum occurs is called a turning 
point. 


2.5.4 Conditions satisfied at a turning point 

Suppose f(x) is differentiable in a < x < band that there is a maximum 
or minimum value of f(x) at the point x = Xo. Then if f’(xo) 4 0, > 0, 
{f(Xo + h) — f(%o)}/h can assume both positive and negative values in 
the neighbourhood of x, provided h is small enough. Hence at x = Χο 
the function is not differentiable. But f(x) is differentiable at x = Xo 
and the assumption f’(xo) # 0 is false. Hence f’(xo) = 0 is a necessary 
condition for a turning point, and these are located by solving the 
equation f’(x) = 0, i.e. by finding those points on the graph of y = f(x) 
where the tangent is parallel to the x-axis. 

This condition is not sufficient to discriminate between a maximum 
and a minimum value. If there exists x) such that f’(xo) = Ὁ then 
usually f(x) changes sign as x increases through the value x. If f’(x) 
changes sign from positive to negative then f(xo) is a maximum value, 
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and if f’(x) changes sign from negative to positive then f(x») is a 
minimum value of the function. See Fig. 2.9. 

The use of the sign of f’(x) to the immediate left and immediate right 
of a root of f’(x) = 0 is an infallible method of distinguishing between 
maximum and minimum. 


y = f(x) 


f’(x)<O 


ie) Xo 


(a) Maximum at x = 29 (0) Minimum at x= x9 


Fig. 2.9 


Alternatively the sign of f’(xo) may be investigated in order to 
discriminate between maximum and minimum. At a maximum turning 
point the curve is concave downwards, i.e. f(x) decreases as x increases 
through the value xo, or f’(x9) < 0. Similarly at a minimum turning 
value the graph is concave upwards, f’(x) increases as x increases 
through the value xo, i.e. f"(xo) > 0. 


2.5.5 Conditions for a point of inflexion 


Suppose f’(x.) = 0, then usually f"(x) changes sign as x increases 
through xp. If f"(x) changes sign from Positive to negative, x = χρ Is 
a transition between f’(x) > 0 (i.e. f(x) increasing, f(x) concave 
upwards, and tangent ‘below’ the curve) and f"(x) «0 (ie. f(x) 
decreasing, f(x) concave downwards, and tangent ‘above’ the curve). 
Such a point of transition is called a point of inflexion and the tangent 
there which ‘crosses’ the curve is called the inflexional tangent. 

Similarly if f’(x) changes sign from negative to positive as x increases 
through xo, then (xo,f(xo)) is a point where the graph of y = f(x) changes 
from being concave downwards to concave upwards and there is a 
point of inflexion at x = Xp. 


2.5.6 Summary 

Let y = f(x), then 

(i) Conditions for a turning point at x = xo are f’(xo) = 0 and f’(x) 
changes sign as x increases through Xo, 
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(a) f(x) changes from positive to negative (or f’(xo) <0) gives a 
maximum at x = Xp; 

(b) f’(x) changes from negative to positive (or f”"(x%o) > 0) gives a 
minimum at x = Xp. 


(ii) Conditions for a point of inflexion at x = χρ are f’(xp) = 0 and 
7΄Οὐ changes sign as x increases through Xo. If f’(xo) = 0 and f"(x) 
does not change sign as x increases through Xo, it is necessary to con- 
sider the sign of f’(x) to the left and to the right of xo. 

Points at which f’(x) = 0 are called stationary points, which may or 
may not be turning points. 


Example (i) Investigate the stationary points of f(x) = x°. 


Solution f(x) = x*, f(x) = 3x?, f’(x) = 6x. Now, f(x) =0 when 
x = 0, f(x) > 0 for all values of x so that there is no change of sign 
of f’(x) and hence neither maximum nor minimum. 

Also f"(x) = 0 when x = 0, and f"(x) changes sign from negative 
to positive as x increases through the value x = 0. Hence the stationary 
point (0, 0) is a point of inflexion, and the inflexional tangent is the 
line y = 0. See Fig. 1.8 for the graph of y = χϑ, 


Example (ii) Investigate the stationary points of f(x) = x*. 


Solution f(x) = x*, f(x) = 4x, f"(x) = 12x°. Now f’(x) = 0 when 

= 0 and f’(0) = 0. But when x = 0 there is a minimum value of 
J(x), since f’(x) changes sign from negative to positive as x increases 
through the value zero. 


Example (iii) Investigate the inflexion point on the curve y = sin x, 
--ΞῬ}δπ - χ -ζ 5π. 


Solution y = sin x, γ' = cos x, γ΄ = —sin x. Now γ' = 0 when cos x = 
0 which has no solution in the given range. Also y” = 0 when sin x = 0 
and x = 0 is the only value in the range. Moreover y” changes sign 
from positive to negative as x increases through the value x = 0. Hence 
(0, 0) is a point of inflexion. The slope of the inflexional tangent is 1, 
since y'(0) = 1, and its equation is y = x. See Fig. 1.17(a). 


2.6 DIFFERENTIATION OF INVERSE FUNCTIONS 

Let y=/(x) be differentiable and monotonically increasing in 
a<x <b. Then the inverse function x = ¢(y) can be shown to be 
differentiable for all value of y between f(a) and f(), provided that 
70) #9. 

In Fig. 2.10, let P be the point (x1, f(x), then x, = @(y1) and P can 
be considered as the point (¢(y1), γι) on the graph of x = $(y). In effect 
the same graph holds for both functions. If f’(x) # 0, the tangent at P 
to y = f(x) is not parallel to the x-axis, which also means that the 
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tangent at P to x = φ() is not perpendicular to the y-axis, and #(y) 
is differentiable at P. 

Let tan p be the gradient of the tangent at P so that f’(x,) = tan y. 
Let 6 be the angle made by the tangent at P to the y-axis so that 
¢ ()1) = tan θ. Since 6 = ἐπ — y, tan 9 = I/tan y, or 


ae oe 
dx (dx/dy) 


Fig. 2.10 


2.6.1 Differentiation of the inverse circular functions 


(i) The inverse function y = sin~? x means 


x = sin y and —in<y<in 
and hence, 
dx 
ay = cos y 
= -ε νῷ — sin? y) 
= Ξεὀ (1 — x”). 
In the range --π < y < ἐπ, cos γ is positive so that 
Ot ee 
ἀχ V(i — x?) 
or a sin? x = as |x] <1. 


(ii) The inverse function y = cos-+ x means 


x=cosy and O<y<z, 
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nd ep at 
a ae —sin y 
= FV — cos? y) 
= Ἐν -- 24). 
In the range 0 < y <7, sin y is positive, dx/dy is negative and 
a -ly— -ὶ 1 
7008 χ = Vans) |x| < 
(ili) The inverse function y = tan-1 x means 
x = tany and —kr<y Ξ ἐπ, 
ἐχ 
and ΠΩ sec? y 
= 1+ tan? y 
=1+ x? 
d 1 
ane l ye SS 
and 7 tan x= 7 $x 


Example (i) Show that arecos x + arcsin x = ἐπ. 
Solution Let f(x) = arccos x + arcsin x, 
, = 1 
then POST τὰ πε We ey Ὁ 
Hence f(x) = c for all values of x, where c is some constant. Writing 


x = 0, 
{0 =37+0=¢, 
1.6, arccos x + arcsin x = ἐπ. 
Example (ii) Differentiate (a) arcsin x?, (Ὁ) arcsin (cos x). 


Solution 


1 
va — yy * 
- 2x 
= Ya —x) 


d 
ee 1 2) ... 
(a) Ὡς {arcsin x?} = 


d : d ; 
(b) ἧς {arcsin (cos x)} = oe {arcsin t} where ἢ = cos x, 


=e in 1 τὰ 
=F resin ἢ) ἘΞ 


ae. dt 
= Vd — 4 * ax 
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_ —sin x 
~ a/(1 — cos? x) 


_f-1 if sinx>0 
~ | 1 if sinx <0. 


2.7 THE PARTIAL DERIVATIVE 


The derivative of a function of two variables with respect to either is 
called the partial derivative. Suppose w = f(x, y) is a function of the 
two independent variables x and y defined over the region R in the 
(xy) plane and consider the expression 


70 ὁ + ἢ, yo) — f(x Yo) 
h 


If this tends to a finite limit as h —> 0, its value is denoted by f2(%o; Yo). 

It may be called the rate of change of f(x, y) with respect to x when y 

is held constant at the point (xo, yo) in Δ. The function F(x, y) is called 

the partial derivative of f(x, y) with respect to the variable x, ie. 
Lim + 49) -- fey) 
"50 h 


= f(x, y) 
of 


when this limit exists. Other notations are 1) 


2) 
or () ἴο 
᾿ : of 5 x y constant ox ν᾿ 
emphasize y is fixed, ὃν and fi(x, γ) this last meaning the partial 


derivative of f with respect to the first variable mentioned. If no am- 
biguity can arise, the notation @w/dx where ὦ = S(x,y) is also used. 
Similarly 


of f\ .. fx y+kh -- 7, y) 
tian α Ho (B) pest 


is the partial derivative of f with respect to y and may be regarded as 
the rate of change of f with respect to y when x is considered to be held 
fixed. Then the ordinary rules of differentiation apply. 


ὃ 
Example (i) Ox sin (2x? — y) = 4x cos (2x? — y) 


a, 
ὃγ sin (2x? — y) = —cos (2x? -- γ). 
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Example (ii) x = r cos 6, r and 6 being independent variables 


x : 
δ, = COS 8 (6 considered constant), 


Ox ᾿ : 
= sin 6 (r considered constant). 


Suppose also that y = r sin 0, then 


In this example, r = +/(x? + y?) (x, y independent) so that 


8 
= = ea (y considered constant) 


ox 1 

or 7 (rjox) 

In calculating 0x/ér, x is a function of the independent variables r and 9 
and 6 is considered as held fixed; in calculating 0r/@x, r is a function 
of the independent variables x and y and y is considered fixed. Thus 


(@x/@r) and (@r/@x), are not necessarily reciprocals. 
Note however, that from x = r cos θ if 0 is held fixed, 


Ox or 1 
(5), =cos@ and (Z), aah 


(=) 6 ~ ΤΣ 


1.6. 


so that 


2.7.1 Higher derivatives 


Since f, and f, may themselves be functions of x and y they may under 
certain circumstances be differentiated partially with respect to x and y. 


ὃ (ὃ 
If of exists as a limit it is denoted by 
ox \Ox 


232 
fabs y) or FE or fixlsy) 


Similarly 


5 (2) = or fy{x,y) OF foolx, y) 
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Also 


a 83 
Ὁ “11 =a Or ful%y) oF farl, y) 


ὃ 82 
ox = (2) - 5 or fry(x,y) or fialx, y) 


These last two are often called ‘mixed’ derivatives. 


2.7.2 Extension to functions of three or more variables 


The definition of the partial derivative of a function of two variables is 
extended to a function of three or more variables. For, let V(x, y, 2) 
be a function of the independent variables x, y, and z, then 


av . Vxth,y, 2) — V(x, y, 2) 
=o = Lim —— eee 
Ox hwo 


ie. V is differentiated partially with respect to x; y, and z being con- 
sidered constant. This may be written 


( Ἴ ( Ἴ 
ox y,2 constant ox Y.z 


Similarly if V(x1, x2, X3,. . ., X,) is a function of the independent 
variables x1, Χο, X3,. . +) Xny 


ov . Vox + A, χα, χα, ον. Xn) — VOX, Χρ, Xa, s «0s Xn) 
= im oi ee “' 
aX, aro h 


ie. V is differentiated partially with respect to x,, the other n — 1 
variables being held constant. 


Example (i) f(x, y) = x? sin (2x? — y) 
ὃ 
δ = 2x sin (2x? -- y) + χϑάχ cos (2x? -- y) 


ὃ 

- —x? cos (2x? — y) 

23 ὃ (ὃ 

a -z (Z| = 2 sin (2x? — y) + 2x4x cos (2x? — y) 


+ 12x? cos (2x? — y) — 4x°4x sin (2x? — y) 
= . — 16x*) sin (2x? — y) + 20x? cos (2x? ~ y) 


oF a = (Z) = —x? sin (2x? — y) 


oy? dy \a 

o? ὃ (ὃ 

oa τίς) = —2x cos (2x? — y) + 4χ sin (2x? — y) 
22 = z 
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In this example the mixed derivatives f,, and f,, are equal, which 
suggests that the operations 0/0x and 0/éy are commutative. This is not 
always true, but it is possible to prove that when f,, and ἔν are both 
continuous they are equal. 


Example (ii) If z is a function of x and y determined by the equation 


Ζῇ — 3yz — 3x = 0, 


show that 
az Oz 832 éz\? 832 
(a) 2x Oy and (b) ΠΕ t (=) | πον 
(Assume z? — y # 0.) 
Solution 28 — 3yz — 3x = 0 (i) 
(a) Differentiate (i) partially with respect to x to give 
az az 
2 -- -- = 
Ἔ ox Ἵ ox : 
. oz 1 
1.6. ax 2 --ν 


and with respect to y to give 


az az 
Fas aac ee = 
3z ay 3y ay 3z 
oz 
os dy Ζ΄--ν 
dz 062 5 
hence 25. = ay (ii) 


(b) Differentiate equation (ii) partially with respect to y 
az Oz az Oz 
dy ox ' 7 dy ox dy? 
substitute for z/@y from equation (ii) and 
éz\? az az 
7\ax) + 7 dyax ὃν 


which gives the required result when @z/0y 0x is replaced by 07z/0x dy. 
Notice that ordinary rules of differentiation of constants, of products, 
and of function of a function are used. 
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EXERCISE 2 
1 Find f’(1), £0) for the function f(x) = x? + I/(L + x?). 
2 Find ΚΓ), Κ({πὴ for the function sin 2x + cos * x. 


3 Differentiate 
. i+tanx Sea Gass! ten 
(i) Toa Gi) νί + sin? x); (iii) sin? x cot 2x. 

4 Show that the gradient of the curve y = x? + 6x? + 15x + 28 is 
positive for all values of x. Show that the curve has a point of 
inflexion when x = —2 and state the gradient of the curve at this 
point. Sketch the graph of y = x3 + 6x? + 15x + 28. 


5 Find dy/dx as a function of ὦ if x = t+ sint and y = 1 — cost. 
Show that d?y/dx? = 1/(1 + cos r)?. 


6 Find dy/dx in terms of ¢ when 


_il-t _1—2t 
or ae Οὐ as 
d?y 2 (1 + 2t\8 

and show that «ἀ- -Ξ (1 3}᾿ 


7 Find dy/dx at the point (2, 1) on the curve given by 
x® — 3xy + γ" = 3. 
8 Find the equation of the tangent at the point (1, 0) on the curve 
3x3 + 2xy? = 2y + 3. 


9 Find the coordinates of the points on the curve x? — xy + y? = 12 
at which the gradient is zero. 


10 Show that the mth derivative of cos x is cos (x + 477), and find the 
nth derivative of (x + 1) cos 3x when x = 0. 

11 Find the coordinates of the turning points on the curve 
y=x+1—2cosx in the range 0<x<2z, distinguishing 
between the maximum and the minimum. Show that the tangent to 


the curve where x = 0 is also the tangent at the point where x = 27. 
Draw a sketch of the curve in the range 0 < x < 27. 


12 Find the values of f,, f,, fez, and ὅν, and verify that f,, = fy. for 
the following functions. 


(i) x/y; (ii) (x? + γ)η; (iii) x cos y + y cos x. 
In (i) and (ii) assume x 4 0, y #0. 
13 Show that, if w = 1/r" and r? = x? + y? + 2? then 


Ou μη ὃὲμ n(n — 1) 
aa teat ga pare ΓΤ 0. 
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14 If u and v are functions of the independent variables x and y which 
satisfy the equations μῆ — v? + 2x + 3y =O andw+x—y=0 
and u #0, v #0, find @u/dx, dujéy, dv/dx and dv/éy in terms of u 
and v. 

15 Show that if V(x, y, 2) = x/(x? + y? + 225 

ΟΥ OV ov 
ἘΣ 2 2 2 
then x > ey Gye = 2}, x? + y? + 22 £0, 
16 If V = x"f(Q) where n is a constant and 


eV 29.» 1 a@V 


oe tx ox + Re? 


show that 
τὰ 


=0, (x0). 


3 
Infinite Series and Convergence 


3.1 LIMIT OF A SEQUENCE 


Consider an infinite sequence of numbers denoted by {a,} or a1, do, ag, 
«4 Gy). . . Where 7 is a positive integer. The value of any term depends 
on its position in the sequence, and there is no last term. 


1 
Example (i) 0, 4, % 3%. . slo. ἐν ὩΣ 1. 


Example (ii) 1, 14, 13... . ..2—1/27,.. .,n >0. 
Example (iti) 1,4,4,.. ., 1/2... ..n 0. 
Example (iv) y, y*, 3,...59%5..4n 21. 


Example (v) 1, —2, 3, —4,...,(—1)"*?1,.. 7 


WV 


Fig. 3.1 


Graphically each of these examples may be represented as points on 
a straight line, or as functions of n by a ‘graph’ which is a set of isolated 
points defined for integer values of n only. Figure 3.1 shows the ‘graph’ 
of {y"} when y = —} and Fig. 3.2 shows the graph of the sequence 
{1 — I/n}. 


48 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


A sequence approaches a limit / if when any positive number, say h, 
however small, is chosen, then eventually all the terms of the sequence 
lie in the range ] — ἡ to] + ἢ, 1.6. a, >1 as n— © if it is possible to 
choose A so that all the terms of the sequence after the Nth, lie in the 
range / — ἢ to / + h, where N depends on the value of ἢ chosen. Such 
a sequence is illustrated in Fig. 3.3. 


xX } 1 1 iad Exeteoea | 1 Ἐξτ > 
oO 4 2] 3 4 5 6 7 8 9 10 n 
Fig. 3.2 
x 
ἼΣΟΥ ΡΣ ee rn eA ene ee Cee 
Χ 
ἡ Bats ome he cha ne τος σοι πε a ἐδιθιίοςς XD Eph Dee ais Kt oe. 
x x x 
Χ Χ 
ΟΞ ΓΞ μιὰ χύσις SS Sa = apiece fase 
x 
x 
rn 1 i 4 L mt 4. a LL. ᾿ ι 
Ὁ Ν Ω͂ 
Fig. 3.3 


For example, consider the sequence a, = 2 — 1/2", choose ἢ == 1/10°. 
All the terms of the sequence after the tenth are within the range 
2 — 1108 to 2 + 1/10° and hence the limit is 2. 

In the examples above, sequence (i) has limit 1, sequence (ii) has 
limit 2, sequence (iii) has limit 0. The behaviour of sequence (iv) 
depends on the value of y; it will be shown in section 3.3.1 that the 
limit is 0 if —1 < y <1, but that if y is outside this range there is no 
limit. Sequence (v) has no limit. 

A sequence of positive terms {a,} is said to tend to infinity if for 
every choice of a number A, however large, a, is greater than A for all 
large values of n. This is written a, > 00 asm — οὐ. Similarly a sequence 
of negative terms a,—> —0 as n-» © if it is possible to choose a 
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number, say —B where B is any positive number, such that all the 
terms of the sequence beyond a certain point are less than —B. A 
sequence which does not tend to a limit or to + is an oscillating 
sequence. For example, if a, = γ᾽ and y = —I, the sequence —I, 1, 
—1,1, —-1,.. . is an oscillating sequence. 

A sequence is monotonic increasing if a,,; > a, for all values of n. 
Similarly a sequence is monotonic decreasing if ¢,,; <a, for all 
values of n. 

A sequence is bounded above if there is some number C, such that 
ad, <C for all values of n, and it is bounded below if there is some 
number £, such that a, > E for all n. Sequence (a), 

1 


0, , ξιν νι aa, eee 


is monotonic increasing and is bounded below by 0 and above by 1. 
Sequence (v), {(—1)"*2n} is bounded neither above nor below. 

The most fundamental theorem on monotonic sequences is the 
following, which is stated without proof. An increasing sequence which 
is bounded above tends to a limit; a decreasing sequence bounded 
below tends to a limit. An increasing sequence which is not bounded 
above tends to infinity as n tends to infinity, a decreasing sequence 
which is not bounded below tends to minus infinity as 7 tends to infinity. 

Suppose / and /’ are the limits of the sequences {a,}and {b,}respectively. 
Then a, = /+k, and δ, =l' + Κ΄, where n depends on the values of 
kns Kn’ chosen. It is not difficult to show that λὶ + yl’ is the limit of the 
sequence (λας + ub,}, where 4, u are any constants. For it is possible 
to choose a positive number ἢ, however small, so that for sufficiently 
large n, |Ak,| < ἐπ and |wk’,| < 3h, and hence 


|Ad, + ub, — (Al + μ|Ὑ] = [λκ, + μκ',} « hh. 
that is, Ran + μδ, > Al + wl! as n> 


In particular, if A= 1, “= —1 then a, — 5,1 -- Π that is, the 
limit of the sequence formed from the difference of two sequences is 
the difference of the limits. 


Example (i) Examine the sequence {a,} = {1 — 1/n}. 


Solution Since n+1>Aa, 
Sens 
n+l = n 
1 1 
and 1 -- ἡ: Ὁ 1— x 
then Angi > a, allan. 


Hence the sequence is monotonic increasing. Also a, < 1 for all n 
and so the sequence is bounded above and tends to a limit as m > οὐ. 
z<Moreover lim a, = 1. 


n> oe 
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Example (ii) Examine a, = 1/2". 


Solution Here, ay41 < dq, Since 1/2"*1 < 1/2", all n, and a, > Ὁ for 
all n. That is, the sequence {a,} is decreasing, bounded below, and 
hence has a limit. The limit is zero. 


Example (iii) Examine a, = n. 


Solution Here 4,41 > a,, the sequence is monotonic increasing 
bounded below by zero and is not bounded above. Hence a, —> 00 as 
n— oo, 


Example (iv) Examine the sequence {a,} = {(1 + 1/n)*}. 


Solution The general term is 


= n an — Ὁ 1 nin — \\(n — 2) 1 1 
cae ae ἘΣ 5 5 1.2.3 gs ὙΠ πε 


1 1 1 1 2 
=141475 (1-5) +pg5 (1-3) (1-Z) +. 
1 1 2 n—1 
+ orga (t-3)(1-9)-- Ὁ) 
Consider any value p > n, then 


1 1 1 1 2 
amitiers(t—3)+raa (1-5) (1 -- Ξε... 


reer ee) 


Now a, > a, since ad, consists of more terms than a, which are all 
positive, and in those which correspond, those in a, are the greater, for 


example: 
1 1 1 1 
τ 29] 


: 1 1 1 1 
ἘΠΕ ΕΣ ap po τας . 


—_— 


Therefore {a,} is an increasing sequence. Moreover the sequence is 
bounded above since 


1 1 Ι 
ἄς ΘΑ TEE Pa ΤΥ ΕΣ τ 


Τ᾿ Ἢ 1 
«ΤῈΤΉ ΣΈ ΡΣ Ἐ- -. +o 


= 14 1420-0 


< 3 for all values of 7. 
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Hence a, approaches a limit since it is an increasing sequence bounded 
above. This limit is denoted by the constant e. 


3.2 CONVERGENCE OF SERIES 


3.2.1 Sum of a series . 
A series is the sum of terms of a sequence. The series δ᾽ a, having a 


1 
finite number of terms is called ἃ finite series and has a finite sum. The 
00. 
series >. a, having an infinite number of terms is called an infinite series 
1 


and may or may not have a finite ‘sum’. 
The sum of the first 2 terms of a series is denoted by s, and is called 
the nth partial sum. Thus s; = a, Sg = αι + ag, Sg = Gy + Ag + Gs, 
κω, Sp =A, + dg +ag+...+4,. For example, consider the 
infinite series 


1 1 1 


Ι 
P+5+gtgt- + tonat-- + 


then, δι = 1, Sg = 1h, 585 = 13,. . 5 5, = 2-1. 


These partial sums themselves form a new sequence which has the 
limit S = 2 as n > oo, and this limit of partial sums is called the ‘sum 
to infinity’, or just the sum of the infinite series. This is written 


1+4+@QP?+...+G@"+...=2. 


The ‘equals’ sign is used here to mean the limit of the partial sums. It 
does not mean that a large number of terms of the series may be added 
to give the sum 2, for there is no value of n such that the sum of n 
terms is 2, nor is 2 a member of the series. 

When the sequence of partial sums {s,} tends to a limit S as n > ©, 
the series is said to be convergent, or to converge and the limit S is 
called the sum of the series. When the sequence of partial sums does not 
converge, the series is divergent, that is if s,—> 0 or s,-» τοῦ or if 
S, oscillates. 


3.2.2 Conditional and absolute convergence 
When the series 
Q4,+dgt+adg+...ta,t+... (1) 


consists of some positive and some negative terms it may be conditionally 
convergent or absolutely convergent. 
If the series 


Ια} + [αι] + las} +... +a.) +... (2) 


converges then, as shown below, so also does the series (1), and (1) 
is then said to be absolutely convergent. 
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If series (1) converges and series (2) does not converge, then (1) is 
said to be conditionally convergent. For example the series 


Ι-π-ἐπὲπ-:. 


is conditionally convergent, since it converges (see section 3.5.3), yet 
the series 
L+et+et tHe. 


may be shown to be divergent (see section 3.3.2). The series 
1 1 1 
1- 58 + yp 42 +... 
is absolutely convergent since the series 
1 1 1 
1+ 55 + 32 Ἔχε --- 


may be shown to be convergent. See example (iii) of section 3.5.1. 
To show that if (2) converges then so does (1), write u, = 4(|a,] + an) 
τες = ξ([α,] — a,). When a, > 0,u, = ὦ, and v, = 0; when a, « 0, 
= 0 and v, = —a,, so that whatever the sign of a,, u, τὰ 0 


and v, > 0. Also |a,| = uy + Up, dy = Un — Ups Un < |a,|, and > [ας] 
is eonyereent 


Suppose Σ [an = A, then >u, <A for all values of n. Hence the 
partial sums Su, tend toa limit, ie. Su i is convergent. Similarly 3», 
is convergent. "Using the result that the limit of a Sequence {u, — ἢ 


is the difference of the limits, leads to the result that Ya, = Seen — U,) 
is convergent. 


3.3 SOME STANDARD INFINITE SERIES 
3.3.1 The geometric series 
ae eee ee = dy 
The nth partial sum, s, is given by 
S=lty+ty?+...+y™? 


Ι -- γῆ 

l—y 
1 ΠΕΡ 
as a ET a 


When —1 <y <1, then y*—0 as n> οὐ and s,— 1/(1 — y), so 
that, by the definition of convergence of series in section 3.2.1] the 


geometric series > y” is convergent and has the sum 1/(1 — y). 
n=0 
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If y< —1 or y > 1, y" does not tend to a limit as 1 --» οὐ and hence 
neither does s,. 
If y = 1 the series is 1 +1+1+. ἡ and δὲ τὸ πττ οὐ withion: 


Hence, if y < —1 or y > 1 the geometric series Σ y” is divergent, 


if —1 < y < 1 it is convergent. 


3.3.2 The harmonic series 
The series 


1+h4h+E+...= 5 Im 


is sometimes called the harmonic series. This is divergent, for although 
a, = 1/n—0 as π-- οὐ, it is possible to prove s, increases beyond all 
limit. The sum of the first four terms 


L+et+h+d>l+e+et+eal+ htt 
The sum of the next four terms 
P+htht host etateat=d 
The sum of the next eight terms is > 8/16 = } and so on. Hence 
Sb ett Pee 


to k terms where n = 2*-! and increases beyond all limit as nm — oo. 
Hence the series is divergent. 


3.3.3 The series Σ I/n(n + Ὁ 


The series = 1/n(a + 1) is convergent and its sum can be found. The 


nth partial : sum is given by 


1 1 1 1 
ame Oy i Wr 3 eS ἢ 
Ἔ ΘΠ. 1 
and since Ane τὶ 
1 1 1 1 | 1 
MER: eS a aa ek en | 
1 
pores ee ἢ 
and 5, “1 as n->o. 


Hence the series converges and its sum is I. 
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3.4 SOME RULES 


3.4.1 Rule 1 
Let αι +a+a,+... (3) 
be convergent and its sum be S. Then Lim a, = 0 since a, = 5, — ὅπ.-.1 


n-> @ 


and s, and s,_, have the same limit S as n > οὐ, 
Lima, = S—S=0. 


nu—->o 
This property must not be taken as a test of convergence. If a series 
converges it is necessary that the nth term tends to zero, and if the nth 
term does not tend to zero there is no need to test further for con- 
vergence. That is, the converse of this rule is a test of divergence. The 


series Σ a, is divergent if 4, d does not tend to zero as ἡ tends to infinity. 
For example, the series D3 1/n was shown to be divergent in section 


=1 
3.3.2, yet a, = In +0 as n-> ©. The series 


~ a 3 
Saain73titat 
diverges since 
n 1 
ἄρ τα "ΠΕ i+ 1. as no 


3.4.2 Rule 2 
Consider the remainder R,,, after m terms of the convergent series (3). 
This is defined to be the remainder when the first m terms are removed 
and is 
Rn = > An = Ansi + Anse t+ Amiga tb... (4) 
n=mt+1 

The series (4) converges for all integer values of m and its sum is 
S — διι. This means that a finite number of terms at the beginning of 
a series may be removed without altering the convergence of the 
series. The proof depends on the fact that the nth partial sum of the 
series (4), plus the terms removed, is a partial sum of the original 
series (3), which has the limit S. Hence the limit of the nth partial sum 
of (4) is S ~ 5, and hence (4) is convergent. 

Similarly if a finite number of terms is added to the convergent series 
(3) the resulting series is convergent and its sum is S plus the sum of 
the terms added. 


As a corollary, if > a, is convergent, 
Lim R, = Lim (p41 + ἄπ. + Gme3 +...) =O. 


m— 2 mao 


since R,, = S — s, which > 0 asm— οὐ. 
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3.5 TESTS FOR CONVERGENCE 


In order to prove that a series converges it is sometimes possible to 
find its sum. This is not always possible and tests have been devised to 
discover whether a given series is convergent or not. 


3.5.1 The comparison test 


a οο 
Let > a, and > δ, be two series of positive terms. If the second series 
is convergent and a, < cb, for all values of m where c is any constant, 
then the first series is also convergent. For if 


bh δώ ἘΠ 4... .Ξ 8, 
then b,+6,4+ 6, +...+6, <B for all n, 
and a+ agtaz3t+...+a,<cB. 


οο 
Hence the partial sums of the first series tend to a limit and > a, is 
convergent. 


If the series > ὁ, is divergent and a, > cd, for all values of n andc 


is a constant, then > a, also divezges. 
Example (i) Prove > 1/+/n is divergent. 
n=l 
Solution Write a, = 1/+/n, choose δ, = 1/n, then since /n <n for 
1 1 Bing τὸ 
Th >> or a, > δ, for all n. > b, is known to be 


divergent and hence > a, is also divergent. 


all values of ἡ, 


Example (11) Prove that 1 is divergent when p < 1. 
ip 3 8 P 


Solution Let a, = 1/n”, p <1 and choose ὃ, = I/n, then since n? <n 
for all values of n when p <1, a, > δ, for all values of n. But > 5, 


caer: are ἜΝ Care 
is divergent and hence > a, is divergent, 1.6. Σ᾽ — is divergent when 
p<. μι 


Example (iii) Prove that > 1/n? is convergent. 
n=1 


Solution Write a, = GED and choose 6b, = 


(n + 1)? > n(n + 1) for all values of 7, then 


1 ᾿ 
n(n + 1) Since 
ΒΞ: σοῦ eS ae 
(n+l)? <nn+ily 
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all values of n. It is known that Dees) is convergent (see section 


1 τ n(n + 1) 
3.3.3), hence > G+) is eae Now the given series can be 


written 


~ | 
Soq14+5 at -1+ Saar 


n=1 


Hence, using Rule 2 of section 3.4.2, > 1/n? is also convergent. 
1 


Example (iv) Prove that the series δ᾽ 1/n” is convergent if p > 2 


n=1 


Solution Let a, = 1/n?, p > 2, and choose ὦ, = 1/n?. Since 
2n’,pp2 


1 
ae 


then 5» >2 andalln. 


Hence, since > 1/n? is convergent, so also is > L/n? when p > 


It is also possible to prove that Σ 1/n? is convergent when 1 < p < 2, 
using a test for convergence which is not included in this book. The 


series Σ᾽ 1/n?, which is convergent for p > 1 and divergent for p < 1 


n=] 
is used as a standard series in the comparison test. The choice of 
standard series in this test is made from common sense and experience. 
For example, if a, = (2n — 1)/(n* + 1), a, is of the order of 1/n’°, 


οο 
since when n is large, a, = 2n/n* = 2/n°, and the series > a, is compared 
with the convergent series > 1/n°, as follows: 
2n—1 = 2/n®? — 1/n* 


> 1/n® converges, and hence so also does > ay. 
3.5.2 The ratio test 
if One 


π 1 
hence convergent. If r > 1 the series diverges; if r = 1 the test fails. 
For suppose r < 1 then it is possible to choose ἢ such that all the 
terms of the sequence |a,,1/a,| after the Nth lie in the range r — h to 
r+h. Choose ἢ = #(1 — r), then 


rth=Hltn<1 


is absolutely convergent and 


αν μὰ Ων τὰ 


Loe be etc. 


and <rt+h, 
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ΟΥ̓ [ἀν αι] <( + ADlayl, [αν «αἱ < (γ + Alans] < @ + A)? lay] ete. 


and > lan] = lay] + law+a] + [αν αἱ + - 
<layl{l+(r+A+( +h? +. . 4}. 
Hence |a,| is convergent by comparison with the geometric series 


στ λα ΝΠ Adding to this convergent series the finite number of 
terms |a,| + Ja, + |as| +... + |ay-1| and using Rule 2 (section 3. 4. 2) 


shows that > \a,{ is convergent. From section 3.2.2 it follows that 3a, 


is absolutely convergent and hence convergent. 
If r > 1 then, for n sufficiently large, |¢,41| > |a,| and the magnitude 
of the terms increase with n. Hence Lima, 4 0, and by the converse of 


n> @ 


Rule 1 (section 3.4.1) the series > a, is divergent. 
1 
Example (i) Prove that the series > n?/3" is convergent. 
1 
Solution Write a, = n?/3", then 


_ Mtl? 3* 


ad 2 Rtas ὦ 
35:11 * 2 


a, 


οο 
r = } and the series > a, is convergent. 
1 


οο 

Example (ii) Prove that the series Σ᾽ 1/n! converges. 
=0 

Solution Write a, = 1/n!, then 


An+1 = πὶ 
“(αὶ 1)! 
1 
πλεῖ as n->@. 
2 1 
Hence the series 5 a 1+1 Ἔ σι "ἜΤΙ 1. . is convergent. 
n=0 


Example (iii) Prove that the series Σ 2"/n is divergent. 
Solution Write a, = 2"/n, then 


aoe a n 
ay, ~n+1°2* 
2 
= Tain as A> Ww. 


.ο 
Hence > a, diverges. 
1 
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Example (iv) Consider the convergence or divergence of the series 
Sn + in? — 1). 
Solution Write a, = (n + 2)/(n? — 1), then 
ἀκα] n+3 πὲ- 

a, | n+2n'n+2 

( τ 3/n)Q — 1/n*) 
~ (+ 2jnyl + 2m) ~ 

That is, r = 1 and the test fails. This series diverges since 


n-+2 1 
ee aioe ae 
n? — 1 n 


1 as n>, 


> 1/n diverges, hence > a, diverges. 

Example (v) Consider the convergence or divergence of the series 
οὐ 

Σ ( i= 1)" +1 |n. 


Solution Write a, = (—1)"**/n 


Qn+1 = n 
a, | n+l 
i 1 
= —— > 
are Fake as n> 


That is, r = 1 and the test fails. This series converges as is proved in 
section 3.5.3 following. 


The examples (iv) and (v) emphasize the fact that when r = 1 there 
is a failure of the ratio test and that resort must be made to some 
other test. 


3.5.3 The alternating series 


Consider a series with alternating sign, i.e. a series in which the signs 
are alternatively positive and negative. Let the series be a, ~ a, + 
a3 — ας + —. . . where a; is positive for all i, Then if 


Lima, =0O and 4: > a,>43...>a,>0, 


n+ 


the series converges. 
Let sg, be the sum of the first 2” terms. 


Son = (αχ — 49) + (45 — a4) +. .. + Gan-1 — Gon). 
Each bracket is positive so that 825 increases steadily, or 
0-5,-5ς«-... 
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Also 


San = ὧι — {(@2 — 49) + (ας — 5) +. «© + Gan-1 το Gan—1) + Gant 
<a, since all the brackets are again positive. 


Hence 5.5, is monotonic increasing, bounded above and therefore tends 
to a limit. 


Also Son+1 = San + Gant1—> San aS N—-> oO 
since d2,,,;—>0, and thus so,,1 converges. Hence the sequence {s,} 
converges and so does the alternating series. For example, the series 


1—4+4—4+... converges since @4,—>0 as n—+oo and 
a, > ag > a3 >...>a,> 0 for all values of 7. 


3.6 POWER SERIES 
A series of the form 


Ag tax t+ agx*?+...= a,x" 


ims 


is called a power series in the variable x, where dp, 41, do, . . . are 
constants. If it converges it defines a function f(x) given by f(x) = 
Ay + ayx + agx? +.... 

The series always converges when x = 0. It may or may not converge 
for any other value of x. 


Example (i) The geometric series 1 + x + x? +... converges to 
1/(1 — x) provided |x] < 1 and diverges for all other values of x. 


Example (ii) The series 
2 3 


Χο ἐκ 
l+xt+3;+--- 


converges for all values of x since by the ratio test of section 3.5.2 


|x| 


“n+l 


xnth n! 
(n+ 1)!" x" 


for all values of x. 


—0O as n->o 


Example (iii) Consider the series 
x xe xt 
Rc Vie er Vai Ml say 
[| 


xntl n x 
@+b'x = απ Hl as n> @, 

Hence the series converges absolutely when |x| < 1 and diverges when 
|x| > 1. When x =1 the series becomes 1—4+4—}+... 
which converges (see section 3.5.3). When x = —I the series 
—1—~3}—4-—}-—... diverges by comparison with the harmonic 
series (see section 3.3.2). 


then 
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3.6.1 Radius of convergence 
Consider the power series 


ἄρ + aX tFagx? +... .. 


The number R, such that the power series converges absolutely when 
|x] < R and diverges when |x| > R, is called the radius of convergence 
of the series. The series may or may not converge when |x| = R. See 
example (iii) above. When the series is convergent everywhere, i.e. for 
all values of x, R is infinite. 


If SOQ) = Ag + aX + gx? +... ax" +..., 
then SX) = α, + 2agx +... Ἐπα,χητῖ ει, 


for every value of x inside the range |x| < R where R is the radius of 
convergence of the first series. That is, it is possible to differentiate a 
power series, whose sum is f(x), term by term within the interval 
defined by its radius of convergence, and the differentiated series has 
as its sum the derivative of f(x). Both series have the same radius of 
convergence. 
Hence, a power series with radius of convergence R may be differen- 
tiated or integrated term by term within its radius of convergence. 
The existence of R and the statements above require proofs that are 
beyond the scope of this book. 
Example (i) Show that the series 


Dn 
= axe St+Ft. os 
converges for all ae of x in the interval -1 <x <1. 


Solution Using the ratio test, 


An+1 


= Tt —>|x| as nm— oo. 


Hence the series is convergent provided |x| < 1 and is divergent when 

|x| > 1. When x = 1, the series is 1 ἘΞ ὁ ἘΠῚ ἘΠ. ᾿ς which is diver- 

gent (see section 3.3.2). When x = —1 the series is 
—l1+4¢-4+4-+4 

which is conditionally convergent by comparison with 
1-44+3-h+-. 

Thus the given series converges in the interval —l1<x<1 and 


diverges everywhere else. Its radius of convergence is R = 1, 
The result of differentiating this series term by term is 


l+x4x74..., 
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which is the geometric series which converges for —1 <x «1. The 
differentiated series converges at all interior points of the original 
region of convergence but not at the end point x = —1. 

On the other hand, integration of the geometric series term by term 
gives a series which converges at one new point, x = —1, at which 
the original (geometric) series diverged. 


Example (ii) The series 
ao x" x? x3 
Lat itetayt yt: ss 
converges for all values of x. See example (ii) of section 3.6. Differenti- 
ating this series term by term gives 


x? x8 
l+x+s,+3;+- ae 


which is the same series. Hence the function defined by this series is 
equal to its derivative and converges for all values of x. 


EXERCISE 3 


1 Do the following sequences have limits? If so, state these limits. 
Give reasons for your answer in each case. 


(i) 0, 1, 0, —4, 0, 4, 0, —4,. . ., O, 1/277, 0, —1/27**4,0,... 
(ii) 1, 4, 2, 4, 3,.. om. ; 


2 For each of the following a, determine whether the sequence {a,}, 
so defined, has a limit, and find that limit when it exists. 


2n +1 


ὦ τς (vi) sinn 
wi win ἐν 
(ii) 1+ os (viii) ats 
οὐ o> 4 
() 1+ (=p @ τὲ 


3 Find an expression for the nth partial sum of each of the following 
series. Find the sum of the series if it converges. 
re 2 
(i) +atgtyt- : . Ἔ χηξι Ἐ-: 


("5 


a 5 5 5 
G@)5—Z5+G- gt - . -+—jrar τ. 
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Git) 3+64+124...43Q"™454... 
4 State the value of each of the following: 


(i) Σ (-1)'*?x; (ii) > ( xt 2 
γΞ1 r=o \ 
5 Discuss the convergence or divergence of the following series. 
(ῦ x — $x? + x8 — 2χ eo + (H1)*tx4*”n +... 


(ii) > a, where a, = mn 
bee 1 
Gi) Foy where ας = FEET 


6 Discuss the convergence of the series whose general terms are 
(i) (—1)"x"n-*; ii) (sin nx)/n?; iii) (—1)"x?7/(2n)!. 


7 Determine whether the following series converge. 
μας Pe ὅν οι τῶ ie 
ὦ Σ απ νη) τ; Ὁ Σ τὰ; Gi) TH 
n=1 n=1 3 n=1 2 


οο 
8 Prove that, if the series δ᾽ ὦ, converges, then Lim a, Ξε 0, and 
n> 0 


show by an example that the converse is not necessarily true. 
Determine whether the following series ee or diverge 


oOF-rgT ὦ 3 Fore 
9 Find the radius of convergence of each of the following power 
series. 


(n!)8x* 


Oar OFF; w FF 


10 Prove that if > a, converges absolutely then ἸΣ a, sin nx converges 


for all real values of x. 

11 Prove that if a power series dy + a,x + a,x? +... converges 
when x = x, then it converges whenever |x| < |xo|. For what 
values of x do the following series converge? 


1 gee 
ji) Dire a + i* x"; (ii) Σ μῆχπ, 


4 
Exponential, Logarithmic, and 
Hyperbolic Functions 


4.1 THE EXPONENTIAL FUNCTION 


The general exponential functions may be defined in one of several 
ways. Their fundamental property is that the derivative of the function 
is proportional to the value of the function. That is they all satisfy 
equations, called differential equations, of the type 
mmo 

where c is a constant. 

In particular the standard exponential function is defined to be that 
function which satisfies the differential equation 


a (1) 
and such thaty = 1 when x=0. (2) 

In example (ii) of section 3.6.1 the function ye — = was shown to be a 
n=ol 


function which satisfies (1) and when x = 0 its oaiue is 1. Let this 
function be called E(x), i.e. 


AQ=l4+xt 545+... => 5 (3) 

From equations (1) and (2) ° 
ΕΟ) = E(x) (4) 
ΕΘ = 1 (5) 


4.1.1 Properties of E(x) 
E(x) satisfies the functional relation 


E(a)E(6) = Ε(α + β). (6) 
To prove this consider the function F(x) defined by 
F(x) = E(c — x)E(x) (7) 


where c is some constant. Differentiating equation (7) gives 
F'(x) = --Ε (ε — E(x) + Ele — x)E'(x) 
= -- ΕΑἰς — x)E(x) + Efe — x)E() using (4) 
= 0. 
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Hence F(x) = constant, since its derivative is zero 
or F(x) = k say, for all values of x 
or E(e — x)E(x) = k substituting from (7). 


Now, writing x = 0, 
E(Q)\E(O) = k, 


or E(c) = k since E(O) = 1 from (5). 
But k = E(c — x)E(x), 
hence E(c — x)E(x) = E(o), 


and writing c — x = a, x = β so thatc = α + β gives 


E(a)E(6) = Ε( + 8) 
which is equation (6). 
Replacing « by x and β by — x in equation (6), 


E(x)E(—x) = E(0) = 1 from (5) 


; 1 
1.6. E(—x) = EG (8) 
Repeated application of (6), setting a = B =... = x, gives 


{E(x)}" = E(X)EQ)E(x). . . E(x) to n factors 
=Ext+x+x+...+ x) (1 terms) 


i.e. {E(x)}" = E(nx) =n =1,2,3,.... (9) 
Hence if p and q are positive integers, setting x = ρίᾳ, n = q in (9) 
{E(p/q)}* = E(p) = {E()}? (10) 
hence E(p/q) = (Ε(}}» (11) 
and E(x) = {E(1)}*_ whenever x = p/q. (12) 


Since a rational number is any number which can be written in the form 
P/q where p and q are integers and φ + 0, equation (12) is true whenever 
x is a positive rational number. Writing —p for p in equation (11) 
shows that equation (12) is true when x is any rational number, positive 
or negative. 


4.1.2 The number e 
This is chosen to be E(1), the value of E(x) when x = 1, i.e. 
2 1 


1 1 
e= KI ΞῚ ἘΤ ἘΞ ++. : Da 


and its value may be calculated to any degree of accuracy. For example, 
if thirteen terms of the series are added, e = 2-718 281 8 correct to 
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seven decimal places. From.(12) 
{E(D}’ = EQ), 
so that οἵ = E(x), 


and the properties of E(x) developed in section 4.1.1. are written in 
terms of e* as follows: 


d 7 
ae (67) = e* from (1), 


x2 3 oo) 
faltxt 5+ Rt. -.= > = from (3), 
! ! “Ξοπὶ 


e“e® = εὐ = from (6), 


Fig. 4.1 


4.1.3 The graph of y = e* 
When x = 0, y = e° = 1, dy/dx = e° = 1, so that the gradient of the 


curve at (0, 1) is 1. 
When x is positive, e? > 1 + x so that e* > οὐ as x-> 0, and x 


increases steadily with x. 
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When x is negative, write x = —t, t>0 then e? =e-' = I/e! is 
always positive and as x — — οὐ, t—> οὐ, e* > 0. 

Hence as x increases, e? increases monotonically and is always 
positive. The graph is shown in Fig. 4.1. 

When the exponent is a lengthy expression it is sometimes convenient 
to write ο΄ as exp (x). 


4.2 THE LOGARITHMIC FUNCTION 

Since the function e” is continuous and monotonic increasing for all 
values of x it is possible to define its inverse function (see section 1.2). 
This is the /ogarithmic function defined by the relation y = log, x such 
that x = ὁ". 


Fig. 4.2 


Since x = 6" takes all positive values, log, x is defined for all positive 
values of x only. The subscript e, will be omitted in the subsequent text 
and the symbol log will be used to denote the logarithm to base e. (The 
abbreviation In is also used.) Any base other than e, will be specially 
designated as such. For example log:, x means logarithm to base 10, 
the common base used in computation. 

The graph of y = log x is derived by ‘reflecting’ the graph of y = e* 
in the line y = x and is shown in Fig. 4.2. The important facts to notice 
are log 1 = 0, since e° = 1; y is not defined for negative values of x; 
and as x > 0, y—> —o. 


4.3 LIMITS CONNECTED WITH e* AND log x 
As x — οὐ, 6 > οὐ more rapidly than any power of x, 1.6. 


<> @ as X00 (13) 


for any fixed positive value of ἡ. 
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The function e?/x” assumes the indeterminate form 00/00 as x > οὐ, 
but since 


x8 
falexty +3 jt. 


then, with m any integer greater than 7, 
m 


x 
’>—: 
m! 


m~ πὶ 


ev x 
Therefore — > —; 
x m 


which tends to infinity with x. Hence e7/x" — 0 


as x — οὐ, or e* ‘grows’ more rapidly than x”. 
Similarly it can be proved that e-* tends to zero more rapidly than 
any power of x as x — οὐ, 1.6. 


x"e-?¥—>0 as x70, (14) 


Similarly it may be proved that as x — οὐ, log x > oo more slowly 
than any power of x, or 


κε x 


(15) 


4.4 DIFFERENTIATION OF e* AND log x 
4.4.1 The derivative of e* 
From the definition of 67, 


a 3 
hence ae οἷ: = keX* where k is a constant, 
using function of a function rule. 


4.4.2 The derivative of log x 


If y=logx, x>0 
then x =e" 
and 5 =e! 
Hence - log x.= : 


As x increases, the derivative 1/x decreases, so that the gradient of the 
graph of y = log x decreases but is always positive (see Fig. 4.2). 
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More generally, 
d 1 
7x 108 |x| Ξε x #0, 


|x| is always positive so that log |x| is always defined. When x is 


negative, |x| = —x, log |x| = log (—x) so that 
4 jog |x| = Ζ tog (—x) 
1 
[eay 
“4 
=< 
Also £6 _f£e) 
gl fe) = Τὼ provided f(x) # 0, 


since, writing f(x) = z, & = ΓΟ), 
d d dz 
ax 198 1.0)] = | los |zl ἃς 


-Σγὼ 


. 60) 
~ fx)” 


4.4.3 The general exponential function 
The general exponential function is the function a’, a> 0. If y = a’, 
then 


log y = x loga 
and y = ezlose 
or a? = eee, 


Thus, instead of a* the simpler function e*!°%¢ can be used in calcula- 
tions. In particular the simpler form is more useful in differentiation, 
for example, 


d 


—@g=— ezloga 


dx 
= ets toga a>Od0 


= a’ log a. 
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To differentiate the inverse function, y = log, x, write x = αὐ = eve 


dx loga 
1 Ὁ δα loga 


Ξε xloga 
dy I 
and dx xloga 
d 1 
or Ζ: Ἰοβαχ = x Toga" a>0 


4.4.4 Logarithmic differentiation 


In the case of differentiation of a function which consists of a number 
of factors it is sometimes simpler to take the logarithm before differen- 
tiating. Thus if 


Y Ξκο)βο)βο). . fi) 
then for any value of x which gives y a non-zero value, 
In y = Inf,(x) + Inf.) + Infor) +. . . + In f(x) 


and differentiations with respect to x leads to 


Ld πῶ δὼ KO, Ke) 
γαῖ FAG) fle Jae ὦ 


Example (i) Differentiate x* where « is any real number and x > 0. 


Solution Now, x* = e*!°®? since x = e!°8 and so 


d 
PP Ξ a ae 


= eclosz μὴ 


Example (ii) Differentiate x*, x > 0. 
Solution Let y = x7 = etlez 
dy zlogz l 
7 ic et log: (logx + x.) 
= x? (log x + 1) 
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alternatively, let y = x* 


then log y = x log x 
d A= +1 
an i ae og x 


ad 
& = ylog x + 1) 


= x*(log x + 1). 
Example (iii) Differentiate y = log (log x). 
Solution 
Das 
x ~ logx’ x 
ΠῚ 
~ x log x. 


The function log (log x) is defined for (log x) > 0 and hence for x > 1. 
: . ὧν e77(x? + 1)(x + 1)? sin x 

Example (iv) Find x if y= Soe 

Solution Write 


log y = 2x + log (x? + 1) + 3 log (x + 1) + log sin x 
— log (x* + 4) — log (3x — 2) 


d ldy _ 2x 3 cos x 4x? 3 
an yar SSP RR a ari 


for any value of x for which y τέ 0. 


4.5 THE HYPERBOLIC FUNCTIONS 

There are certain combinations of the exponential functions e* and 
e~* which are called ‘hyperbolic functions’. They are used in solving 
certain engineering problems and are introduced now so that they may 
be used in the systematic study of integration in Chapter 6. 


4.5.1 Definitions 

The combinations $(e7 + e~*) and 3(e? — e~*) occur with such fre- 
quency that it is convenient to give special names to them. They have 
properties analogous to those of the circular functions cos x and sin x, 
and are called hyperbolic cosine of x or cosh x, and hyperbolic sine of x, 
or sinh x, respectively, i.e. 


cosh x = (εἴ + 6.7) 
sinh x = #(e7 — ς. 2]. 


(16) 
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The remaining hyperbolic functions are defined in terms of cosh x 
and sinh x as follows: 


sinhx εἴ —e-? 


tanh ΕΣ coshx” e*+e7 
δοῖεν: cosh x = e+e? 
sinhx e*?~—e* 
sech x = I = eee 
coshx εἴ 

cosech x = 2 


sinhx  e? —e-* 
4.5.2 Identities 
The basic identity of hyperbolic functions is 
cosh? x — sinh? x = 1 (17) 
which is proved as follows: 
cosh x + sinh x = 3(e? + e~*) + de® — e-*) = e?, 
cosh x — sinh x = }(e* + e~7) — #(e* — ee") = ες ““, 
Hence the left side of equation (17) is 
cosh? x — sinh? x = (cosh x + sinh x)(cosh x — sinh x) 
Ξε εἴ ο΄ Ὁ 
ΞΞ 
Dividing equation (17) by cosh? x gives 
1 — tanh? x = sech? x (17a) 
and dividing equation (17) by sinh? x gives 
coth? x — 1 = cosech? x. (17b) 
4.5.3 Addition formulae 
There are results analogous to those of the circular functions given by 
sinh (x + y) = sinh x cosh y + cosh x sinh y 
cosh (x + y) = cosh xcosh y + sinh x sinh y (note signs) 


These follow from the definitions of section 4.5.1 and are easily proved, 
starting with the right side. 


Also sinh 2x = 2 sinh x cosh x 
cosh 2x = cosh? x + sinh? x 
but 1 = cosh? x — sinh? x 


so that, adding and subtracting respectively, 
cosh? x = 3(1 + cosh 2x), sinh? x = (cosh 2x — 1). 
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4.5.4 Derivatives 
From the definitions in section 4.5.1, 
d d 
ee ΞΕ ee 
ax sinh x ae 4(e e*) 
= 3(e* + e-*) 


= cosh x. 


d 
Similarly ae cosh x = sinh x, 


ad tanh x = sech? x, 
dx 


d 
and — coth x = — cosech? x, 
dx 


each of which may be checked by the reader. 


4.5.5 Graphs of the hyperbolic functions 
(i) The properties of y = sinh x = 3(e* — e~*) are 


(a) x>0, sinhx>0 

(b) x = 0, sinh (0) = 0 

(c) 2 =coshx = fe? +e°7)>0 all x, 
ie. sinh x is an increasing function for all values of x. Also sinh x is 
an odd function since 

sinh (—x) = $(e7? — e*) = —sinh x 

(ii) The properties of y = cosh x = }(e? + 617) are 

(a) coshx >0O all x 


(b) cosh (0) = 1 


>0O whenx>0 


dy ; 
(©) dx sake Ξ 0 whenx <0. 


Hence when x is negative y decreases as x increases and when x is 
positive y increases as x increases. Also cosh x is an even function since 
cosh (—x) = #(e-7 + e”) = cosh x. 


The graphs of sinh x and cosh x are shown in Fig. 4.3. 
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y= sinh oc 


Fig. 4.3 


(iii) From the expression y = tanh x, 
Ν sinh x 
~ cosh x 
e*7—e?* 
et + e7? 
e?* — | 
et + 1 


e2t + 1 
it follows that 
(a) tanh (0) = 0, 
1 


dy 
(b) 5 = conry 9 9. alls 
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(c) tanh x increases as x increases for all values of x, 


2 
(d) tanh x = 1 Sea ot as Χ —> οὐ, 


+1 


and 
(e) tanh x < 1 for all values of x. 


Also, tanh x is an odd function since tanh (—x) = —tanh x. 
For sketch graphs of tanh x and coth x, which is derived from that 
of tanh x, see Fig. 4.4. 
y 


y=coth x 


y =coth x 


Fig. 4.4 


4.6 THE INVERSE HYPERBOLIC FUNCTIONS 


4.6.1 Range of the inverse hyperbolic functions 
Since x = sinh y is a monotonically increasing function for all values 
of y it is possible to define the inverse function y = sinh~* x (some- 
times written y = argsinh x) for all values of x and y. (Note, argsinh x 
but arcsin x.) 

Similarly y = cosh~* x (or argcosh x) is defined where x = cosh y 
is monotonic increasing, 1.6. when y > 0, and x > 1. 
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The function y = (πῃ ἢ x (or argtanh x) is defined for all values 
of y since x = tanh y is monotonic increasing in this range, and 
|x| <1. 


4.6.2 The logarithmic form of the inverse functions 
Gi) If y = sinh“? x, then 
sinh y = x = {(6" — ς΄"). 
Multiplying by 26" and rearranging, 
οὖν — 2xe¥ — 1 = 0. 
This is a quadratic equation in e” whose roots are given by 
e=xt V(x? + 1). 


Now e’ is positive so that x + +/(x? + 1) must be positive also. Since 
»/(x? + 1) > x the positive sign of the square root is chosen, and 


ev =x + V(x? + 1) 
or y = log [x + VQ? + J) 
or sinh~1 x = log [x + +/(x? + 1)] for all values of x. (18) 


(ii) Let y = cosh~} x, then x = cosh y, x > 1, y > Oand so sinh y > 0. 
Using the definitions (16), together with cosh? y — sinh? y = 1, 


6" = cosh y + sinh y = x + ν (3 — I). 
Hence y =cosh? x = log [x + VQ@?— J], x 21. (19) 
(iii) Let y = tanh~? x, then x = tanh y, |x| < 1, and 
6" ΞΕ e7¥ 
οὖν — I 
“ai 


or fie 
1-χ 


x= 


1+x 
Ι--χ 


1 
Similarly coth-! x = } log at |x| > 1. (21) 


Hence y = tanh7'x = } log >» (|x| <1. (20) 


4.6.3 Derivatives of the inverse hyperbolic functions 


These are simple algebraic functions similar to the derivatives of the 
inverse circular functions. 
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(i) Let y = sinh~* x, then x = sinh y 


ax 

ay = cosh y 
= +/(sinh? y + 1) 
= Vx? ἘΠ 


The positive square root is chosen since cosh y > 0 for all values of y. 
Hence 


[ane anes 
ἀχ V(x? + 1) 
d 1 
—gnhtl + =< 
or Ti sinh7* x = Vor + Ὁ all x. 
Similarly 
d 1 
ἐξ: ee 
Ae cosh7* x Vato τῇ x>1 
d 1 
ee Ἔχε  ος ς 
Gy [ἅπιΓ1 x = τ τ τὰ, |x| -1 
d —1 
ἘΣ -ι,χ,---Ξ- - 
ay coth~! x Yor |x] > 1. 
EXERCISE 4 


1 Differentiate with respect to x the following functions. 
(i) exp αὖ; (ii) exp (sec x); (iii) x* log (1 + χϑ); 
(iv) (1 + x?) log (1 + x7); (v) x tan7? x — log o/(1 + x); 
(vi) log [x + (x? — 3)]; (vii) sinh (2x7); (viii) x log x — x. 
2 Differentiate with respect to x the following functions: 
(i) χ; Gai) (sin x); (δ (ν᾽; 
(iv) xv; (v) δ πόθο (vi) 1019ε sinz 


3 Evaluate f’(x)/f(x) when f(x) = (1 + x)? and by successive differ- 
entiation calculate the values of /’(0), f”(0), and f"(0). 


4 Differentiate with respect to x the following functions: 
(i) (cosh x — cos x)/(sinh x — sin x); (ii) sech x; 
(iii) x tanh7? x + log νῷ — x?); (iv) tan~? (sinh x); 
(v) log tanh x. 
5 Sketch the curve y = tanh? x. 
6 Find é@u/éx and ou/ét if a, δ, and p are constants and 
u = ε΄ “5 cosh (pt — bx). 
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7 Differentiate the following functions: 
(i) 2’; 
(ii) exp (sin? x); 
(iii) av*, a> 0; 
(iv) sinh 3x cosh® x; 
(v) (x? + 1)9/2x — Gx + 2; 
(vi) log [C1 — x)/(. + x)]; 
(vii) x? exp (1/x); 
(viti)} a*x*, a> 0. 
8 Given that z = x", calculate 82Ζ ὃχ dy and 07z/éy?. 


9 Determine the nature of the stationary points of the function 
x*e-*, and show that for all values of x > 0, χε  " < (4/e)*. 


10 Determine the position and nature of the stationary point of the 
function cosh 2x — 4 cosh x. 


11 Prove that, if μ = x log (1 + x/y) + y log (1 - y/x) then 
ou, ou 
x5 +y ay = {. 


12 Differentiate (i) exp (x sin x); (ii) (log x)*; (iii) (x log x)/(1 + x); 
(iv) (log x)~*; (v) log (sec x + tan x). 


13 Prove that, if y = sin~* (e~*) then 


d*y dy 


2 
cos? y pat sin? y = = sin y cos y. 


dx = 
14 Show that, if A and B are constants and 
y = Acos (log x) + B sin (log x), 
d*y 


2 dy 
then VRatxaty=0. 


15 Show that sec x+ log (cos x) is an increasing function of x in the 


rangeO <x < 3° 


16 Prove that, if y = tan? (sinh x), then 
2 


d’y 2)" = 
πὰ ttany (Z = 0. 


17 Find dy/dx where log (x? + y?) = tan7! (y/x), x? + y? 4 0. 


18 Use the definitions of cosh x and sinh x to solve the simultaneous 
equations 
cosh x + cosh y = 4 


sinh x — sinh y = 2. 


Give your answers correct to three significant figures. 
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19 The temperature y° of a body ¢ seconds after it is placed in a certain 
medium is given by y = 27 + 636. 6, Find correct to two places 
of decimals, after how many seconds the rate of cooling of the body 
will have fallen to less than 1° per second. 


20 Solve the equation 
16 cosh x — 4sinh x = 19, 


giving the answers correct to three places of decimals. 


5 
The Mean Value Theorem, 
Taylor Series, and Further 
Partial Differentiation 


5.1 THE MEAN VALUE THEOREM 


5.1.1 Introduction 

It is geometrically intuitive that if a function y = f(x) is continuous in 
the interval a <x <b and has a derivative at each value of x in 
a< x «ὃ and the values of y at A where x = a and B where x = ὃ 
are equal, then there is at least one point between x = a and x = ὃ 
where the derivative is zero. Moreover, if the values of y at A and B 
are not equal and the other conditions hold, then there will be at least 
one point, x = c, between x = aand x = ὁ where the tangent is parallel 
to the chord AB. The sketch of functions y = f(x) in Fig. 5.1 demon- 
strates this property. 


The graph of y = f(x) may have a vertical tangent at either or both 
of the end-points of the interval, so that for example, the function 
/(a? — x’), ~a <x <a which represents a semi-circle satisfies the 
requirements. 

These two results may be stated precisely in the form of theorems 
as follows: 


(1) Rolle’s theorem Let y = f(x) be (i) continuous for a<x <b, 
(ii) differentiable for a < x < ὃ, and (iii) let f(@) = f(b). Then there is 
at least one number ¢ satisfying a < ς < ὃ and such that f’(c) = 0. 
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(2) The mean value theorem Let y = f(x) be (i) continuous for 
a<x <b, and (ii) differentiable for a < x < b, then there exists at 
least one number c such that 


fe) —f@ =(b-af(d and a<c<b, (1) 


= po O=La 


The fraction [f(b) — f(a@)]/(6 — a), which is the ratio of the increment 
in the value of the function to the increment in the value of x, measures 
the average rate of increase of the function in the interval b — a, and 
is represented by the gradient of the chord AB of Fig. 5.1. Hence the 
mean value theorem implies that the average or mean rate of increase 
in any interval is equal to the actual rate of increase at some point 
within the interval. For example, the average velocity of a moving point 
in any interval of time is equal to the actual velocity at some instant 
within the interval. Or, if the average speed of a moving vehicle is 
50 km.p.h. over a given interval of time, then at some instant during 
that interval the speedometer registers 50 km.p.h. 
Other ways of expressing equation (1) are as follows: 


and a<c<&, (2) 


(i) writing c = a + θ(Ὁ — a) where θ is some number between 0 and 
1, and writing a + h for ὁ so that b — a = fh, then (1) becomes 

Κατ ἢ) -- Καὶ =hf'(a+ 6h), 0<6<1. (3) 
(ii) setting 6x for A in (3) gives 

fla + 6x) — fla) = 6x f(a + 9 dx) 

or Of = dx f'(a+ 66x) 0<6<1 (4) 
where δ is the increment in the value of f due to an increment 6x 
in x when x = a. 


Certain deductions follow immediately from the mean value theorem, 
as follows: 


(i) if f(x) = 0 for all values of x such that a < x < b then f(x) is 
constant fora <x <b. 


(ii) if f(x) 0 for all x such that a << x < ὁ, then f(b) > f(a) and 
J(x) is strictly increasing fora <x <b. 


(iii) if m < f(x) < M for all x such that a < x < ὃ, then 
m(b — a) < f(b) — Κῶ < Mb — a), 
(iv) if fs) = f’2() for a < x < b then 
d 
FLAG) — fC] = 0, 


and f,(x) — fo(x) = constant fora <x <b. 
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5.1.2 The generalized form of the mean value theorem 


Suppose f(x) and g(x) are two functions which are continuous for 
a<x < band differentiable for a < x - b and that g(a) # g(6), then 
the function ®(x) defined by 

fo) -- Κῶ 


O(x) = fib) -- - δ) — g(x)}, 
οὐ =f) — 1) — 5G) = gq@ BO -- 809} 
satisfies the conditions of Rolle’s theorem. Hence there is at least one 
number c between a and ὁ such that ®’(c) = 0, ie. 
ἢ fo) -- Κῶ 
LO + 26) = ῶὅ 
Moreover if f’ and g’ are not simultaneously zero this may be written 
flo) - fa) _ fe) ‘S 
gb) — κ() ge) 
and this is the generalized or Cauchy form of the mean value theorem. 


g(c)=0 and a<c<b. 


5.1.3 Application to small errors 
Let y = f(x) satisfy the conditions of the mean value theorem, then 
from equation (4) 

dy = Ox f(a t+ 06x), O<49<1. (6) 
This is an exact equation, and gives the value of ὃν when 6x is known 
for some value x = a. If 6x is small then (6) may be used to compute 
an approximate change in the value of y corresponding to a small 
change ox in the value of x when x = a. For if 6x is small enough 
(ὦ is an approximation for f(a + @ 6x) so that 


dy = 6x f'(a) (7) 
(See equation (18) of section 2.4.) 


Example (i) If y = +/x, calculate the approximate error in the calcu- 
lation of y when x is wrongly given as !-02 instead of its correct value, 1. 


Solution Here f(x) = Vx, f(x) = 3x7 ', fC) = 4, 6x = 0-02 and 
ὃν = 0-01. 
Example (ii) The period T of a simple pendulum is given by 
T = 2nvi(lig) 
where / is the length of the pendulum. If an error of } per cent is made 


in measuring /, and g is accurate, find the corresponding percentage 
error in the value of 7. 


Solution This question involves percentages and it is well to remember 
that this implies percentage of something. 

Suppose the error in ἡρηρεσυ ἰ is positive, i.e. ἰ is measured too 
large, then the error δὶ is 4 per cent of /, or δ] = 3 per cent, and it is 
required to find 67/T. 
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From T = 20 v/(I/g) 
log T = log 27 + 4 log/ — 4 log g 


and differentiating implicitly with respect to / leads to 


l1dT 1 
Td UW 
Using dT = (dT/dl) δ᾽ then gives 
or 1dT 
T Td 
1 
= 5 δὶ 


= 4-4 per cent, 


and the approximate percentage error in T is }. 
If the measurement of / is too small, the error is negative and this 
simply means a change of sign of δ΄ in the working. 


5.1.4 ?H6pital’s rule 


Suppose f(x) and g(x) are two functions which are continuous in 
a<x <b, and that f’(x) and g’(x) exist for each x ina<x<b, 
that g'(x) #0 and that f(a) = g(a) = 0. Then I’H6pital’s rule states 


that 
Lim 70) ) enn δ] 
provided the second limit exists. For, since f(a) = g(a) = 0, 


fe) _foy-—fa _ fo 
50) ~ g@)—g@ ~~ g@ Mm “ὁ 


applying equation (5) of section 5.1.2. As x > a+, c—>a+ and 


cee (Cerrar CY 
Sg) eo nes 


provided this limit exists. 

This establishes the rule when x approaches a from above. The case 
when x approaches a from below is proved similarly and the two cases 
together give the result (8). 

The rule (δ) is used to determine Lim {f(x)/g(x)} where substitution 


(8) 


>a 
of x = ain the quotient leads to the expression 0/0 which is indetermin- 
ate. 


, Si 0 
Example (i) Lim a [= 31 
z>0 x 
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0 
Example (ii) En Be [= Ὶ 


The rule was established on the assumption that g’(x) * 0 in the 
open interval a < x < ὃ and hence is valid when g’(a) = 0. Thus if 
f‘(a) is also zero then equation (8) gives another indeterminate form 
0/0 and applying Hopital’s rule again, 

£@) _,, ὦ 
Per ΤῸ) 
provided the second limit exists. 

More generally, applying rule (8) m times 


‘(m) 
Lim? 2 Lim ΤῸ 
Ia g(x) Ia & (x) 
provided this limit exists. Conditions on f, g and their m — 1 deriva- 
tives may be deduced from considerations of the mean value theorem 
and conditions for ’H6pital’s rule. 

In practice the method consists in differentiating numerator and 
denominator separately so long as the form 0/0 appears at x = a. As 
soon as one or other of these derivatives is different from zero at 
x = a, the limit may be found. 


Example (ii) Lim vera ser [= Ἵ 


pp ey sO bare) Mica [= Al 


m0 2x 
- - 
ay en #1 + x) 
z>0 2 


= --1 


ge 
It can be proved that if f(x) ~ oo and g(x) o as x —a, then 
!'H6pital’s rule applies to the indeterminate form 00/00. Proof is 
beyond the scope of this book. 
The indeterminate forms οὐ. Ὁ and οὐ — οὐ may be evaluated in 
some cases by reducing them to a form in which l’H6pital’s rule applies. 
The forms 0°, 1°, 00°, as limits, may also be evaluated if the expression 


Ὁ 


can be reduced to a form in which l’H6pital’s rule applies, for example 
by considering the logarithm of the expression. 


Example (iv) Find the value of 


Li > 1 [- οὐ --οὦ as x>04 
paar sinx Xx ~ =o +0 as x->+0— 
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Solution The expression may be re-written as 


. Χ πο βίῃ χα | | 
Lin ———— Ξε “]» 
reo ΧϑΙΠΧ 0 
and applying I’H6pital’s rule twice the limit is 
: 1 — cos x 0 
πὸ -------- -- - 
zo SINX + Xx COS x 0 
‘ sin x 
1Π|:-.-------------- 
po 2COSX — xsinx 


I 
Θ 


Example (v) Find the value of Lim x*. 
z>0+ 


Solution Writing y = x’, then 
log y = x log x 
= log x/(1/x), 


and ne [- =| 
rot "ΙΧ © 


1/x 


= Lim 3 = Lim —-x=0. 
z>0+ —1/x 270+ 


Hence Lim logy = 0 

zO+ 

so that Lim y=e? = 1, 

στὸ: 

The assumption made in this last step is that 
Lim (log y) = log ( Lim y), 
z>0 z>0 

which can be proved since log y is continuous. 


: = ier lee) 
Example (vi) Evaluate Lim 3 [= Ξ] 


Solution Applying ’H6pital’s rule, the required limit 


ΟΣ + 2 et 
Lim ~ = Lim 
rok psa 3X 

a ee 

= Lim = 
«τῷ OX 

Ζ 

= Lim > 
row 6 


and there is no ‘limit’, or the ‘limit is infinity’. This shows that οἷ > oo 
more rapidly than does x°. See section 4.3 and example 8 at the end of 
this chapter. 
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5.2 TAYLOR SERIES 


The mean value theorem of section 5.1.1 states that if f(x) is continuous 
for a <x <b and has a derivative f’(x) fora < x < 6, then 


fe) —f@ = (Ὁ — afc) where a<c<b. 
Suppose now that f’(x) is continuous for a < x <b and that f(x) 
exists for a - x < δ. Consider the function defined by 


0,(x) — a ΕΣ ©,(@) = fib) -- fx) — (6 -- vf’) 


δ — x)? 
= 5 = a {fe} —f@ = (ὁ ἘΞ. ayf'(a)}. 


This function is zero when x = a and when x = 6 and satisfies the 
conditions of Rolle’s theorem. Its aa is 


“FO +/'C)— 6 - αὐ ὦ + F= GVO -- fla -- 6 — af'@), 


ἘΞ =} ) (Ὁ) — fla) — ὁ — af'@ — Kb — af"). 


Hence = exists a value c between a and b such that this derivative 
is Zero, i.e. 
J) =f@+ (ὦ -- af @ + (ὁ -- afc) and a<c<b. 
Extending this argument, suppose f“~*(x) is continuous for 
a<x<band af (x) exists for a < x - ὃ and consider the function 


b— 
o,0) — Ὁ Ξ ἘΝ 
᾿ b -ν χα ᾿ 
©, (x) = at S428 Say Oisen 5 oa γον. 
This function satisfies the conditions of Rolle’s theorem and hence its 


derivative is zero for at least one value of x = c between x = a and 
x = δ. This leads to the result ee as Taylor’s theorem that 


®,,(a) where 


f(b) = Κῶ + ὁ — af'a) τ τ fg ‘(ay +. 
+o eee fr cer: yf). and ax<c<b. ©) 


This may be written more pee and slightly differently, by writing 
x for ὃ and using the summation symbol, i.e. 


n-1 1 
LE) =f@ + 2 ait @e — αὐ" + Ral) (10) 
where R,(x) is the remainder after n terms and 


RG) = me — afc), a<c<x. 
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The special case a = 0 is known as Maclaurin’s theorem, and writing 
c = θχ where θ is some number between 0 and 1, 


fe) =f + "Sai xmpomr(Q) + FOX). (1 


The formulae (9), (10) and (11) are valid only if f(x) is defined for 
all numbers x between a and b. This assumption is not true for all 
functions, so that not every function can be expanded in a Taylor 
series. It is true for most of the more common functions. 

The form of the remainder R,(x) in (10) as (x — a)*f™(c)/n! and in 
(11) as x"f™(6x)/n! is known as the Lagrange form of the remainder. 
Even if f(x) exists for all values of x for a << x < ὃ and the Taylor 
series can be found for /(x), it is necessary to show that the difference 
between f(x) and the sum of the first n terms of the series, or R,(x) 
is “as small as we please’ as n — 00. 


5.3 POWER SERIES EXPANSIONS OF FUNCTIONS 


The problem of representing a function by a power series is an important 
one, and can be divided into two parts. First it is necessary to develop 
a general formula to give a power series representation of a function 
of x, i.e. to give a series of the type 


Ay + A,X + gx? ++... 4+ a,x" +... 


Secondly it is necessary to decide whether a given function can be 
expressed in this form and if so, for what range of values of x the series 
represents the function. The second part of the problem is by far the 
more difficult and is in general beyond the scope of this book. 


5.3.1 The Maclaurin series 


Let y = f(x) be the given function and suppose the power series 
expansion of the function can be found, i.e. 


SQ) = Ay + Aix + Agx? +. FART AL... (12) 
Let f(x) = do + ax + ax? +... tax’ +... for |[x}<R (13) 
(see section 3.6.1), 
then f(x) = a, + 2aax + 3agx? +... + γα,χῖτὶ 4+... 
$'(x) = 2a, + 3.2agx +... - γ( — lax? +... 


and ¢(x) =n!a, + terms containing powers of x, for all x for 
|x| < Rand for all integer values of n. Setting x = 0 in these equations 
leads to 


_ +40) (14) 
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Hence if f(x) has an expansion (12) it must be of the form 


fi) +x f'O) + = POA x fPO) +... (15) 


which is Maclaurin’s expansion (11) of section 5.2. This development 
shows no more than that a condition which must be satisfied by f(x) 
is that its derivatives of all orders exist for |x| < R. By considering 
the sequence of remainders R,(x), Ro(x),. . .. Ra(x), . . «, it is possible 
to estimate the remainder after n terms of the expansion, and if the 
sequence tends to zero it is possible to show that the series (15) con- 
verges and for which values of x. 


5.3.2 Some standard series 

(i) For f(x) = log (1 + x), f(0) = 0, then 
SOR) = (- )5 ἡ — DI + x), 
FO) = (—) a — 1), 


and substituting in equation (15), gives 


2 3 r 
log +x =x—-S + 5-4...450-DICN +... 


ees Xe x" 
= — «- --- - να τ ως 
Χπς Ὁ“ +..-.+(-) pote 
The remainder after terms of (15) is R, where 
nx” ὦ -- 1)! 
R(x) ma (-1) ni 4 + 6x)" 
and |R ΓΤ ἘΣ ἐΞ 0<6<1 
᾿ nil + 6x|” : 
P : ᾿ |x|" 
It is necessary to consider om nll + Ox|* fo<x<l, 
x” 1 
then 0< nl + θα)" < 2 
and Lim |R,(x)| = 0. 
n> o 


2 3 n 
Hence log(l+xs)=x—-F4+>—+...¢(-prtS +... 


3 
(16) 
is valid 0 <x <1. 
If x is negative, it may be proved that the series is valid for values 
of ἣν between 0 and —1 only, so that equation (16) is valid for 
-l<x<l. 


88 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 
(it) For f(x) = sin x, then 

f(x) = sin (e + ψηπ) 
and ΧΟ = sin dar. 


When is even, n = 2k, and f(0) = sinka = 0 
When n is odd, n = 2k + 1, and f™(0) = sin (2k + 1)ὲπ = (—1)*. 
The Maclaurin series for sin x is 


x xe (—1)*"1x22-1 
VS ett ope  Ὁ 


2n+1 


Σ x 2 

with Rona (xX) = @m+p! sin (0x + 4(2n + 1)πὴ 

and 0 < 6 < 1. This is the remainder after (2n + 1 terms of (11). 
Since |sin y| < 1, then 


Q2n4+1 
Renee 


(2n + 1)! 
for any fixed finite value of x, no matter how large, and hence 
Lim |Ran+1(x)| = 0. 
᾿ χϑ xd (—1)"-1x22-2 
Thus me 5.8. et aa τοῦς Τὰ 
for all values of x. 


(iii) Analagous methods may be used to show that 


x? x 
al ae ἘΧΈΞΠ a ae Ἐν. 00 —-oO<x¥< 0 
! πὶ 
Ξὴ χϑ xt (—1)"x?” 
cos x = tay eee aE τὴ ee ey 
ο «χ- οὦ 
ἃ Ὁ αλλ π α Ἐὰν [2] ἦτον να (fart... |x] <1 | 


and ἃ is any real number. 
These standard series and their radii of convergence should be 
remembered and used. For example, 
(—2x)? (-29 
ae aa 
--1 n-1 —2x n 
+ ἘΞ Soe a Pais 


log (( — 2x) = (—2x) -- 


is valid for -$ <x «-- 2. 
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(iv) To find the power series expansion of sin x about x = 4a. 
For f(x) =sinx then f(x) = sin(x +47) and f(47) = 1, 
‘(dar) = cos gn. If n = 2k then f?(47) = cos ἄπ = (—1)*, and if 
n= 2k +1, then f°**(4m) = cos (ka + 37) = 0. Using equation 
(10) of section 5.2 with a = ἐπ, the Taylor series expansion is 


(x — ἐπ)ΞὮἜἜ (x — ἐπ)" (-1)5 ἃ -- da) 
ἄπνοα τάδ aa eae Qn -- 2}} 
and [Roe] < SE 


which tends to zero as n tends to infinity for all values of x. 


5.4 TAYLOR SERIES FOR A FUNCTION OF TWO 
VARIABLES 


Let ὦ = f(x, y) be a function of x and y such that ὦ and its partial 
derivatives up to a required order are continuous at (Xo, Yo). Consider 
the value of f(xo +h, yo +k). Keeping the y coordinate yo + k 
constant and using Taylor’s theorem of section 5.2 with a= Xo, 
b=x+hso thatb—a=h, 


I(%o + h, Yo +k) = f(Xo, Yo +k)+ ᾿ξ ο, Yo +k) 
hh? 
+ 7 faloY+tkK+... (17) 
Again, applying Taylor’s theorem to the terms on the right of (17), 


k2 
So, Yo + k) = f(Xo; Yo) + Kf,(%o Yo) + 5 γ7)υνίχο;, Yt... 


k2 
S2(%0, Yo +k) = fi(Xos Yo) + kfy2(Xos Yo) + 5 Sivz%o, Yo)t... 


FSul%o, Yo + k) = firl(Xo, Yo) + Kfzy2(X0, Yo) Pac τοῦῖς 
Substituting these values in equation (17) as far as the second order 
terms gives 


flo + bi Yo + Ὁ = fle Ye) + Kilo 30) + 5 foalto. Yo) +s 
+ Hifl%0, γὼ + kf’, Yo) Ἔν ος 
h2 
+ a See(%o, Yo) Taga age (18) 


This equation expresses the value of the function f(x, y) at x = χο + A, 
y=yYo +k in terms of the values of the function and its partial 
derivatives up to the second order at (Xo, yo), together with powers of 
h =x — xo and k = y — yo. An expansion using higher derivatives is 
written more simply if the following ‘shorthand’ form is used, The 
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expression hf, + kf, is written (h δίδχ + k 8/@y)f and may be thought 
of as the operator (h δίδχ + k 8/8y) operating on Καὶ Then 


ὃ 8λ3 
(i ae +k 5) f 
a ὃ ; δ oF 
is a short form for (1 ae k =) operating on h ee k= 
The result is 


ὃν 
eSB) +05 (E08) 


Ox oy Oy \ Ox ὃν 
af of of 
2 δ 2-. 
or h xt + 2hk Ox by +k By? 


since the mixed partial derivatives are equal because they are continu- 
ous, and A, k are arbitrary constants. Similarly 


a\3 
δ) f= Whar + 3h*k fray + BAK frvy + fav. 


Hence the Taylor series expansion about the point (xo, yo) of the 
function f(x, y) is 


es ἢ ee 
Λα + In Ja ἘΔ = fl yo) ἘΣ 5 (hh + ke) Pso.90) + Ry 
(9) 


remembering that the partial derivatives on the right are all evaluated 
at the point (xo, yo), and where 


ὃ 


6 o\" 
wi(bag + 5) Sore + Bhs yo + 83k) 


and0<6,<1,0<4@ <1. 
If R, +0 as n-» οὐ, the series represents the function S(*, y) near 
(Xo; Yo): 


5.5 APPLICATION TO MAXIMUM AND MINIMUM 
Problems of the maximum and minimum of functions of two or more 
variables occur in many systems of physics, chemistry and engineering. 
In Chapter 2 conditions for the maximum and minimum of functions 
of one variable were developed. In this section the maximum and 
minimum of a function of two variables will be considered. Extension 
to functions of three or more variables may then be deduced. 


5.5.1 Definitions 

Consider the function w = f(x, y) defined in some region R of the 
(xy) plane. The function ὦ = f(x, y) has its greatest value or absolute 
maximum at a point (xo, yo) within or on the boundary of R if the value 
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of the function at this point is greater than the value of the function for 
all other points in the region, ie. if f(xo, Yo) > f(x, y) for all (x, y) in R. 

The function f(x, y) is said to have a Jocal or relative maximum, or 
simply a maximum at the point P (xo, yo) in R, if its value at P is greater 
than its value at any point in the immediate neighbourhood of P. In 
other words, the value of ὦ = f(x, y) at all points (x, y) near to (Χο, Yo) 
and in any direction from (Xo, yo), is less than the value wy = f(Xo, Yo). 
Thus there is a local maximum at (Xo, yo) if 


So + h, Yo + k) < f(Xo; Yo) 


provided A and k are small enough. 

An absolute minimum or least value and a local minimum value of f 
are similarly defined. 

If the function f(x, y) has either a local maximum or a local minimum 
value at (Xo, Yo) then it is said to have an extreme value at this point. 


5.5.2 Analytic conditions for an extreme value 


Suppose ὦ = f(x, y) has an extreme value at the point (Xo, yo) in R. 
Keeping y constant and equal to yo, f(x, yo) is a function of x only which 
has an extreme value at x = x, and hence its derivative is zero at 
X = Xo, 1.€. f7(Xo, Yo) = 0. Similarly, keeping x constant and equal to xo, 
f(Xo, y) is a function of y only which has an extreme value at y = yo 
and hence f,(xXo, Yo).= 0. 

Any point (x, y) at which f, = 0 and f, = 0 is called a stationary 
point of f, so that wherever f has an extreme value it has a stationary 
point. The converse is not true, since it is possible to find functions 
which have stationary points which do not give extreme values. For 
example, if f(x, y) = x*y?, then f, f;, and f, are all zero at the point 
(0, 0) but this point is not an extreme one because f, and f, are also 
zero at all points on both the x- and y-axes and fis positive everywhere 
else. 

The condition f, = 0, f, = 0 determines those values of (x, y) at 
which there is an extreme value of f but this condition is not sufficient. 
Let A be the point (x, yo) and B be (Χο + A, yo + Κ), where A and k 
are arbitrary, and let (Χο, Yo) = 0 = f,(X%o, Yo). Then from the Taylor 
expansion near the point 4, 


70 ὁ + ἢ, Yo + ἢ — Κἴχο; Yo) = HAfez + 2k fey + fry} + Ralh,k) 
(20) 

where the partial derivatives, 7,,, f,,, and f,, are evaluated at the point 
(Xo, Yo) and Rg is the remainder. This term Rg tends to zero with A and 
k and is a lower order of magnitude than any of the other terms of (20). 


It may be assumed for ἢ and k sufficiently small, that (20) may be 
replaced by 


So + A, Yo +k) — fos Yo) = ph? - 2ghk + rk? 
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where 2p = f,.(X0, γο); 24 = fiu(Xo, Yo), and 2r = fy(Xo Yo). 
The quadratic form may be written 


ph? + 2ghk + rk? = pth? + 2ghk/p + rk*/p} 


-ὐρναθ᾽ =e 


This is said to be positive definite for all values of A and k provided that 
pr —q? > 0 and p > 0 and is negative definite for all values of ἢ and 
k provided that pr—gq?>0O and p< 0, and is zero only when 
h=k=0. 

Hence if the quadratic form on the right of equation (20) is positive 
definite for all values of A and k, f(xo + ἢ, Yo + k) > f(%o, Yo) and 
So Yo) is a minimum value of f; if the quadratic form is negative 
definite for all values of A and k, f(xo + h, yo + k) <f(%o, yo) and 
f (Xo: Yo) is a maximum value of f. 

Moreover if h and k are small enough and the second order deriva- 
tives of f are continuous the condition 


f=0=fy Sufi τ ὔν 0 


at a point (xo, yo) means that fhas an extreme value at that point. This 
is a maximum when f,, < 0 and a minimum when f,, > 0. 

If at (Xo, Yo), fr = 0 =f, and fir fyy — £2, < 0, the binary quadratic 
on the right of equation (20) is said to be indefinite and its sign may be 
positive or negative according to the signs of A and k. This gives what 
is called a saddle point at (Xo, yo). 

If at (Xo, Yo), fc = 0 = f, and fi. fiy — 73, = 0, the quadratic form is 
said to be semi-definite and the nature of the stationary point is un- 
decided. Further terms of the Taylor series must then be considered, 
but in this book only conditions of second order terms are considered. 


Example (i) Investigate the extreme values of the function 
Κα, Y= χ +xytyt+x—44+5. 
Solution The function f(x, y) has partial derivatives 


fp=2x+ytl, fpsext2y—-—4 


and these are both zero at (—2, 3). 

The second order partial derivatives are all constant, fj, = 2, 
Suv = 2, fry = 1, and the expression which determines the nature of the 
stationary point at (—2, 3) is 


ἔων —f2, =4-—-1=3>0. 


The function therefore has a minimum at (—2, 3). Moreover this is its 
least value, since 


f—-2 +h, 3 +k) > f(—2, 3) for all h, k. 
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Example (ii) Investigate the extreme values of the function 
SOY) = αὐ + y?)? — 4x? + 2y?) + 13. 
Solution The function f(x, y) has partial derivatives 


Sr = 4x0? + y? — 2), Sy = 490? + y? — 4, 
Sur = 43x? + y? — 2), Suv = 4.3 + 3y? — 4), 
Sey = 8xy. 


There are five stationary points whose coordinates are found by 
solving the equations f, = 0 τῷ ἢ, and they are (0,0), (0, +2) and 
(+2, 0). 


At (0, 0), δεν — ἢν = (—8X(-16) — 0 > 0 
Suz < 0, hence fis a maximum at (0, 0). 
At (0, +2), Sasfuu — Fy = (8)GB2) — 9 > 0 


and f,, > 0, hence f has minima at (0, +2). 
At(+vV2,9), μεν — fiy = ((6)(--8) —0 < 0 
and f has saddle points at (+ +/2, 0). 


5.6 FURTHER PARTIAL DIFFERENTIATION 


5.6.1 The increment of a function of two variables 


Consider now an approximate value of the increment of a function of 
two variables. Let ὦ = f(x, y) and let its partial derivatives f, and f, 
be continuous functions of x and y throughout a region R in the (xy) 
plane containing (Xo, Yo). Then the increment 


δω = f(%o + Ox, Yo + Oy) — ἴσο, Yo) (21) 
is given by 


δω = f(Xo + ὄχ, Yo + dy) — f(Xo + OX, Yo) + Χο + 4x, Yo) — f(Xo, Yo) 
= 6a, + dw. 
where 
δω, = f(Xo + Ox, Yo + Oy) — f(Xo + OX, Yo) (x held constant) 
and dw, = [{χο + 6x, Yo) — f(Xo, Yo) (Ὁ held constant). 


Thus, holding y constant at y = yo and changing x from x, to xy + 6x 
and using the mean value theorem, 


δω, _ f(%o + 6x, Yo) — Χο, Yo) 
6x 6x 


is the value of (@@2/9X) 2,40) = fr(%1, Yo) for some x, between xy and 
Xo + Ox, ie. 


bag = flX1, Yo)Ox and χὺ < xX, < Xp + Ox. (22) 
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Since f, is a continuous function of x and y, its value at (x;, yo) is 
nearly the same as its value at (Xo, Yo) when 6x is small. Hence the 
difference denoted by «, and given by 


Κα, Yo) — ας, Yo) = & 


approaches zero as 6x approaches zero. Substituting for f(x, yo) in 
equation (22) gives 


δὼ = {filXo, Yo) + δ} dx, (23) 
and ¢, -- 0 when 6x — 0. 


Similarly δώ, = f,{xo + ὄχ, vo + θὸγ) ὃν = {f,(Xo, Vo) + Ea} ὃν (24) 


where 0 < 8 < 1 and ες. -> 0 when dy — 0. Substituting from equations 
(23) and (24) in dw = dm, + dag gives 


dw = Χο, Yo)Ox + fu, yo)dy + εἰ 0x + & ὃν (5) 


where δ, and ¢, both approach zero when 6x, ὃν approach zero. 
An analagous result holds for a function of any finite number of 
independent variables. For the case of a function of three variables 


ω = f(x, y, 2), 
dw = f, Ox + f, ὃν + fz 6z + ει Ox + δὰ Oy + ες 62 


where £1, 82, €3 > Ὁ when dx, dy, dz > 0. 


5.6.2 Total differentiation 


Let w = f(x, y) be differentiable at all points along some curve C in a 
region R in the (xy) plane and let the equation of C in terms of the 
parameter ¢ be x = x(t), y = y(t) where x and y are differentiable 
functions of 7. Then ὦ is a differentiable function of ¢ and 


do dwdx ϑὼω ὧν 
a er ae ay di (26) 
This follows from equation (25) where the parameter ¢ has value fy at 
the point (xo, yo) and when δέ —> 0, dx and dy both — 0 and e, and eg 
both — 0. 

The formula (26) may be extended to give the total derivative of a 


function of n variables f(x1, X2, X3,. ..» Xn) Where X1, Xo, X3,. . «, Xn 
are differentiable functions of the parameter t. Then 

qd of dx of dx. of dx. of dx, 

af 8 du Sf Be OD Pa Of ἄχ, (27) 


dt” @x, dt ° Ox, dt ᾿ ax, dt "°° a ax, dt 
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A particular case arises when f(x, y) is a constant. The equation (26) 


then gives 
Oe εὐ αν 
= ox at ἢ dy dt 
which leads to 
dy _dyldt___afidx 


dx dxidt~ — afféy 


The latter result is the derivative of y with respect to x when x and y 
are related in the implicit form f(x, y) = constant (see also section 
2.2.11). 


Example Find dy/dx given that x® + 4x?y? — 3γ5 = 2. Use this result 
to find the equation of the tangent to the curve x® + 4x?y? — 3y® = 2 
at the point (1, 1). 
Solution Here f(x, y) = χὃ + 4x?y® — 3γ5, and hence 

ἂν 5x* + 8xy? 


dx «12x®y? — 15y* 
The equation of the tangent to the curve at the point (1, 1) is given by 


v-v= (2) @-0 


or v-n=(+5)@-0 


which is 3y — 13x + 10 = 0. 


5.6.3 The chain rule 


No essential complication is introduced by considering the derivative 
of the function w = f(x, y) where x and y are functions of two para- 
meters r and 5, ie. x = x(r,s) and y = y(r, 5). Then when s is held 
constant and writing 


the result is the two formulae 
Cw = Ow Ox Ow ὃν 
ers 6x ar τ ὃν or 
ew = Om 0x | Ow ὃν 
as Ox ὃς ay as 


(28) 


More generally suppose ὦ = f(y1, Yas Yar + - +s Yn) and γι, Ya. Vas 
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- + +), are each functions of m variables x1, Χο, X3,...,Xm OF 
Vie = VilX1, Xa, Xa, - «9. ΧΩ), Κ = 1, 2,3,. . .,n, then 
ϑω Ow Oy, 


ox, “4 Oy, Ox, 29) 
This rule is the chain rule of differentiation and may be compared with 
the function of a function rule of section 2.2.9 of Chapter 2. 


Example Given that ὦ = = f(x, y) is transformed to F(r, 6) by substitu- 
ting x = rcos θ and y = r sin θ, find Of/éx and f/éy in terms of r, 6, 
OF/ér and OF/06. 


Solution First set of variables: x, y 
Second set of variables: r, 6 


(a) Of/ax and Af/éy are to be found 


Ox 
(b) = = cos 6, Ε = βἰπθ, 55 = r sin 0, a9 = 1 C08 8 
OF af ax δ day 
© a = ax Or t By Or 
of ox of ay 
4 55 - ax 88 + ὃν 36 
oF of : 
or ὃ, = 008 8 9. + sin 0% (i) 
r 
OF 8 = 
Fath 4 rent (ii) 


The question requires 0f/0x and Of/é@y which can be found in terms of 
r, 0, 6F/ér, and OF/00 by solving (i) and (ii) as simultaneous equations. 
Multiplying (i) by r cos 6 and (ii) by sin 6 and subtracting gives 


6 OF 1 OF 

L = cos — + sino S (30) 
and multiplying (i) by r sin θ and (ii) by cos 6 and adding gives 

ὃ oF 1 oF 

Hm sind + 7008 0 (31) 


Notice that when x = rcos 0, y = rsin 6 are substituted in f(x, y) 
a different function F(r, θ) is obtained. 
Equations (30) and (31) may be written in operator form, 


a a i ὃ 
i (32) 


(33) 
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Since r? = x? + y? and tan 6 = y/x, itis possible to evaluate af/ax 
directly as 
of OF ar oF 00 
ὃχ ar ox + δῆ dx 


and this is left as a useful exercise for the reader. 


5.6.4 Transformation of second order derivatives 


Many applications involving second order partial derivatives are 
simplified by a change of variable. Suppose ὦ = f(u, v) becomes 
F(x, y) on substitution of u = u(x, y), v = v(x, y) where x,y are 
independent variables and u, v are differentiable functions of x, y. 
Suppose also that the second order partial derivatives ἤν and f,, are 
continuous and so equal. ies equation (28) 


_ of mu, of w 


=o du Ox + 35 av ax (34) 
or the operator 

δ. 8 du, ὃ ov 35 

ὃχ Bu ax + Bo ox’ Ge) 


Using oo (35) on equation (34) 
ὃ (OF Ξ ( uf =) 
ax \ax du Ox + Ov Ox 
a ou of Ou u ov red ὃϑυ 
δὰ πα bu Ona ἢ ἐς (loo) = ax + By ax 
of Ou of Pv  du{a ou ὃ Ov 
eae ρα ῦ ἐπ {= cofew = + 5, (efleu) x 
ev (9 ou ὃ ov 
Of Ou, of Hv | BY (du)? , OF dudv | af Ὡἢ 
du Ox? * dv dx2 δι \dx dudv Ox Ox | Ov2\ax 
(36) 


Similarly, 


OF af Pu of a , af a) 5 Gf bu ov δ᾽ = sd 
Oy? ~ bu By + ὃν BA Be lB Budv dy By + δυξ G7) 
and 

OF Of Pu 8 Pv | Pf iudu OF (= ὃν bu 5) 


δχὸν bu axdy Ἢ 3p ov axdy δι ὃν ax ᾿ ϑιμδυ ὃν ὃχ = ax ay 


93 dv dv 
Dot ay ax (38) 
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The formulation of equations (37) and (38) are left as exercises for the 
reader. 


Example Find 07f/6x? if ὦ = f(x, y) = F(r, 6) where x = rcos θ and 
y=rsin 6. 


Solution Using operator (32) to find 0?f/éx? gives 


A (Z) = (ον —tsin 5) (cos 0 — sino) 
= cos 0 = (cos 955 - ρα 056) 
— 7 sin 6 ΟΣ + sin 0) 
= cos 0 (cos 055 + τρία 0 Ὁ — 7 sin 9) 
— Find (—sin dF + 00s 0 = — 7008 0 5 
— + sin 05) 
= cos? 9 τς — 2 cos 0 sin 0 + 5 — sin? 6 


2 Ι 
zat in@— + -sin? 6 — 
+ %cos 6 sin 65 88 ὯΝ τ sin θ or 
Similarly f,, and ἤν may be evaluated. 


EXERCISE 5 


1 By using the mean value theorem or otherwise, prove that for every 
positive number β 
BeF<tl—-eF%< Bf. 


2 If in the observed values of the variables x, y, z there are percentage 
errors of +1, —2, +1-5 respectively, find approximately the per- 
centage error in the value of ὦ calculated from the formula 


w = 27x®y3/4/z4, 


3 The area A of a triangle of sides of lengths a, b, ὁ is given by the 
formula A = 4/{s(s — a)(s — δ) — ΟΣ where s = 3a+ ὁ - ἡ. 
A plane triangular lamina of sides a = 5, ὃ = 6, c = 7 units is to 
be cut from a plane piece of metal. If the side ὁ is cut exactly, but 
small percentage errors «, y are made in the sides a, c respectively, 
show that the percentage error in the area is approximately 
(125a + 49y)/144. 


4 The volume of a cone is obtained by measuring its height ἢ and its 
semi-vertical angle « and then using the formula V = ἐπ tan? a. 
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If the measurements of h and « are subject to small percentage 
errors p;, and p, respectively, find an approximate expression for the 
resulting percentage error in V if « is approximately ὅπ. 


5 Evaluate the following limits: 


ean — 1] .. 7. sinx—t πα, log sin 
ὦ Lim; @) Lim ==; Gi) Lim “Ἔσται 
roy X°— | 20 x z+o log x 
. xsin-?x +x ge, BS α 
(iv) om fog cos x° (v) Lim x? + 1° (vi) ΓΙᾺ x — sinx 


6 Evaluate the following limits: 


7, gx—-@—)D+40-—1? ... .. x — sinh x 
@) Eun (x — 1)3 : ὦ eae 1 + x? — coshx 

7 Evaluate the following limits: 
ὦ Lim -- > i) Lim 
z->0 mtn 


Rae? 
(ii) Lim τ τας = οἷ) (iv) Lim ΝΣ ΩΝ 
2-0 a IT 
(v) Lim x{./(x? + 1) — x}. 
2 0 


8 Use l’H6pital’s rule to prove 


Lim=>=0 and Lim~®* =0 
gow 6 gro x 


(see section 4.3). 


9 The function f(x) is defined by f(x) = —-— when x + 0, f(0) = 1. 
y e*—1 


Use the definition of the derivative of a function as a limit to 
evaluate f’(0). 


10 Derive a power series expansion for log (1 + x) stating its range of 
validity. Show that 
tanh7* x = ἐ log [(lL+x/U —»x)], |x| <1. 


Hence find the first three non-zero terms in the expansion of 
(i) tanh~! x, (ii) tanh~? (sin x). 


11 Write down the series expansion of log (1 + x) and of cos x in 
ascending powers of x and show that, when x is small, 


log cos x = —4x? — x#/12.... 


12 Write down the Taylor expansion about x = 0 (Maclaurin expan- 
sion) for the following functions, stating the general term and the 
range of values of x for which your series is valid: 

(i) sin x; (ii) sin(@ + x); (iii) εἷς; 


(iv) (1 — χγτ"; (v) (1 — x)* when « is not a positive integer. 
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13 Write down the Maclaurin series for the function log (L + 4x?) 
stating the general term. For what values of x is the series valid? 


14 The Taylor series for a function of two variables about the point 
(a, b) may be expressed in the form 


2 1 ( 9g ay 
flat+h,b +k) = fla, δ) ἘΣ ait +k} fla,b). 


Write a + ἡ = x and b + k = yand obtain the Taylor series about 
the point (0, 0) for the function e* cos y as far as the terms of the 
second degree in x and y. 


15 Test the following functions for maxima, minima and saddle points: 
(i) x7 + y? — 2x +4y 48; (δ χϑ -- y? —2x + ἀν 4 8; 
(iti) 5 + 2x + 2y — 2x? — 2χγ — y?. 
16 Show that the function f(x, y) = x? + xy + y? +x -- ἄν + 9 has 
an absolute minimum at the point (—2, 3). 


17 Show that the function 2x* — 3x2y? + γ᾽ + 8x? + 3y? has sta- 
tionary values at the points (0, 0), (5, +6), (—5, +6). Investigate 
the second order properties of the function at each of these points. 


18 Determine the maxima and minima of the function 
S(%, y) = (ax? + by?) exp [—(x? + y”)] 
where a, ὃ are constants and 0 <a < ὃ. 


19 Determine the position and nature of maxima, minima, and saddle 

points on the surface 

z=x? + y? — ἐχ'. 
20 Show that the surface 
zZ=xX*—xy+y?+2x+2y—4 

has a minimum at the point (—2, —2, —8) and no other maximum 

or minimum. 
21 Determine the minimum value of the function 

3(x + y)® + 26 — yp)? — 8x. 


22 Show that the function f = (x? + y?)? + 2a%(x? — y?) where ais a 
constant, has one saddle point and two minima. 


23 Show that the function f(x, y) = χϑ + 2x?y — y? has a saddle 
point at (— 3, τ). 
24 Show that the function 


L(y) =  α + y)? + & — y)? — 12% 4+ y) 
has a minimum at the point (1, 1, —16). 
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25 Prove that, if x = rcos θ, y=rsin 6 and F(r, 6) =f(x, y), then 
1 1 
Fy + pee + 7a Fea = fis + fay: 


Hence, or otherwise, show that f(x, y) = ἰδη 1 (y/x) + log (x? + y?) 
satisfies the equation f,, + fyy = 0. 


26 Change the variables from x, y to u, v where w= x+y and 
v = x/(x + y) and show that x log (x + y) — x + tan74{x/(x + y)} 
satisfies the equation 


fe + Vfy = x log (x + y). 

27 By changing the independent variables from x, y, z to u, v, w where 
u=x,v=x + y/x, w =x + yz show that the equation 

χγύς + Wy — αν = [ἃ + yz — x°2)f, 
transforms to xyF,, = 0 where f(x, y, 2) = Flu, v, w). 

28 If z is a function of u and v where u = x? — y?, v = 2xy, show 
that (i) xz, + yzy = 2(uz, + vz,), (ii) γΖ, — XZy = 2(vZy — Uzy). 
Show also that if 

Zuy + Zu = 0, 


then Zar + Zyy = 0. 
29 Find dy/dx at the point (—1, 1) given that x* — 2y? + 3xy? + 6 
=0. 


30 Show that the tangent to the curve 2x + x log 2x + ye” = 1 at 
the point (4, 0) has the equation 2y + 6x = 3. 


6 
Integration 


6.1 THE INDEFINITE INTEGRAL 


The problem is to find a function y = F(x) such that its derivative in 
some given range, say a < x < ὁ is a given function f(x). In symbols 
this may be written 


: qd 
given 2 = fx), (1) 
find y = F(x). (2) 


For example, if f(x) = 2x then one solution is y = x?. On the other 
hand y = x? + 2, y = x? — 3m are also valid solutions, and so also is 
y = x? + C where C is any constant. 

An equation such as (1) which specifies the derivative as a function 
of x (or as a function of x and y) is called a differential equation. Differ- 
ential equations of more general types will be considered in Chapter 11. 

A function y = F(x) is called a solution of the differential equation 
(1) if, over the range a < x < b, F(x) is differentiable and 


ΕΟ) = fe). (3) 


In these circumstances F(x) is called an integral or a primitive of f(x) 
with respect to x. 

If F(x) is an integral of f(x) then F(x) + C is also an integral where 
C is any constant. For if equation (3) is satisfied, [F(x) + C]' = Ε΄) 
= f(x). Moreover all integrals of f(x) may be expressed in the form 
F(x) + C. For let F,(x) and F(x) be two functions which satisfy 
equation (1), and using deduction (iv) of section 5.1.1, then F,(x) 
and F(x) differ by a constant. The precise value of C cannot be 
determined without further data and hence C is called an arbitrary 
constant. This is indicated by writing 


| fl) dx =F) +C (4) 


where F(x) is any primitive of f(x) and C is an arbitrary constant. The 
symbol f is called an integral sign and J. . . dx is interpreted as the 
inverse of the symbol d/dx. 


6.2 STANDARD FORMS 


Integration as defined above requires the ability to guess the answer. 
The first requirement for skill in integration is a thorough knowledge 
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of differentiation. The following list contains formulae for derivatives 
together with the corresponding indefinite integral. Arbitrary constants 
of integration are omitted, a and n are constants and only the simplest 
form of integral is given. 


d n π-1 π a xt*t 

ax =x [- dx = π  --ἰ (5) 

d 1 dx 

f jog |x| =+ [ S = tos ix 6) 

d 

mene fe dx =e (7) 
a 

At = loga fe ax = og a’ a>OaZl (8) 

d. : 

Fy Sin x τε COS x J cos. dx = sin x (9) 

# = i | in x dx = 10 

1.995 x = — Sin x sin x dx = — cos x (10) 

d 2 

ay tan x = sec x sec? x dx = tan x (1) 

d 2 2 

ας Cot x = —cosec? x cosec? x dx = —cot x (12) 

d 1 dx 

— gant fea fe ne Ὡς eye) 

i ΤΑΡΕΣ ΠΕΞ Ia — 3H = sin” x, Ixl<i (43) 

gO odie al, SO τ geet 

a On * = Tae ice x (14) 

d 

Fx Cosh x = sinh x J sinh x ax = cosh x (15) 

d 

Gy Sinh x = cosh x J cost dx = sinh x (16) 

d 

ης tanh x = sech? x J sech® x dx = tanh x (17) 

d 

4. “οι x = —cosech? x J cosech? x ax = —coth x (18) 

ad. 1 dx 

pate PA igen pe og pe od 

τς sinh x Va +x) [πῷ- Β:Πῃ 1 x 


= log |x Ὁ νᾺ + x?)| (19) 


d 1 dx 
-1 = (ταίρι. πολυ τσερ ον τοττιλῆ ---------- - -.- fa 
po a Vx? — ἡ) [ Vx? -- 1 eee 
= log |x + VG? — 1), x > 1 (20) 
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d 1 dx 
eee -1 =, oe 5Ξ ΤΣ, oo τττῈ τς σιν -1 
7, tanh ΧΞ ee [τξ τα x 
1 ᾿ 
= }log eat x?-<1 (21) 
d -Ι dx 
wes -1ly uw pA ον δ Ξορο a4 
“οἷα = Sy [-Ξ . -- coth"? x 
—1 
= } log Ξ ἢ x2>1 (22) 
6.2.1 Rules 


The following rules help to reduce the amount of guesswork in integra- 
tion in many cases and lead to further standard forms. Arbitrary 
constants of integration are omitted. 
(a) It follows from section 2.2.5 that 


[rar + bg(x)} dx = af i) dx + [a dx (23) 
where a and ὦ are given constants. This rule may be extended to the 


integral of the sum of any finite number of functions. 
(b) From equation (9) of section 2.2.9, 


[χω + b) ἀχ τ : Fax + b), a#0 (24) 
where a and ὁ are known constants and F’(x) = f(x). 


Example (i) i sin (3x + 2) dx = -- cos (3x 4+ 2). 


Example (ii) fox + 3). τ dx = 34/(5x + 3). 


These results follow from equation (24) directly and compare with 
examples (i) and (ii) of section 2.2.9. 

The following are some standard forms which follow from equation 
(24), a, b and n are constants and a + 0. 


oy ee δ ΤῊ 
[τὸ dx τε get) , n#z—!l (25) 
dx 1 
| pH Flos lar + 41 ax+b<0 (26) 
1 
fe dx =e (27) 


[ cos (ax + δ) dx = * sin (ax + b) (28) 
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| qe (:. ae (29) 
ΣΝ ztan-* (3) (0) 
loss Tere Ἢ 3 = sinh ΠΗ = log |x + να + x?)| (31) 
[το 2 - οὐ (2) = log ix + νοῦ -- ὧι, 
“]>1. 62) 
(c) Since 


d 
Fe Soo * = @ ΟΣ ΓΟ, 


using the rule for differentiating the function of a function, then, when 
πῸ --Ἱ, 


[ verre e = Sues (33) 
and when n = —1, 
ΤῸ ax = log foo 89 


provided f(x) τὸ 0. 


Example (i) [: Ose 


sin® 2x 


dx = + [Gin 2x)" ὃ (2 cos 2x) dx 


= + (in 2χ) ὃ (sin 2x)’ dx 
= #(—})(sin 2x) *. 
Example (ii) [ tan dx = | a 


cos x 
(cos x)’ 


cos x 


dx 


—log |cos x| 
i.e. | tan x dx = log |sec x|. (35) 


Example (iii) i cosec x dx = Ξ 


= | dx 
~~ J 2sin $x cos 4x 
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Multiplying numerator and denominator of the integrand by } sec? 4x 
makes this integrable in the form of equation (34), so that 


1 21 
4 sec? 4x 
cosec x dx = oe dx 
tan gx 


= log |tan 4x]. (36) 


Example (iv) [sec x dx may be evaluated in one of two ways. 


First, using equation (36) and the identity cos x = sin (x + 47), 
[sec xdx = Jeosee (x + $n) dx = log |tan(4x -- 8] (37) 


Second, writing 


eis sec x (tanx + secx) (tanx + sec x)’ 
cx = o_O = -----.".----:.:.:ο0ο:0:..: 
(tan x + sec x) (tan x + sec x) 


and using equation (34), 


[sec x dx = log |tan x + sec x| (38) 
Example (v) [os xdx = [το xcos x dx =a — sin? x) cos x dx 


= [os x dx — | (sin x)?(sin x)’ dx 
= sin x — } sin? x. 


This method applies whenever an odd power of sin x or cos x is to be 
integrated. 


Example (vi) feos x sin" x dx, m,n integers. 


If one of m or n is an odd integer the method of example (v) may be 
used to evaluate the integral. 

If both m and n are even integers or if one is an even integer and the 
other is zero, it is necessary to use the identities cos? A = 3(1 + cos 2A) 
and sin? A = 4(1 — cos 24). 

For example, if m = 0, n = 2, 


[sine » dx = {ac — cos 2x) dx = 4x — 1 sin 2x, 


and if m = 2, ἢ = 4, 
cos? x sin* x = (2 — cos 2x — 2 cos 4x + cos 6x)/32 


and the integral is vex — εἷς sin 2x — οἷς sin 4x + re sin 6x. 
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Example (vii) To integrate 
sin mx Cos Hx, sin mx sin nx, and cos mx cos nx, 


where m and n are integers it is necessary to use the trigonometrical 
formulae 


sin A cos B = }{[sin(A — B) + sin(A + B)], 
sin A sin B = 3[cos (A — B) — cos (A + B)], 
cos A cos B = 3[cos (A — B) + cos(A + B)]. 


Thus for example, if m 4 ἢ, 
| cos mx cos nx dx = [ 4[cos (m — n)x + cos (m + n)x] dx 


_ , fsin nae ee) 
at m—n m+n |’ 


and if m = n, 


| cose nx dx = | 4(1 + cos 2nx) dx = 3[x + (sin 2nx)/(2n)]. 


6.3 THE DEFINITE INTEGRAL 

Let f(x) be defined in the interval a < x < band let F(x) be continuous 
in a <x <b and such that F’(x) = f(x) ina < x < ὃ. Then from the 
mean value theorem (1) of section 5.1.1, 


F(b) — F(@) = (ὁ — a) F(x) = (6 — αὐ) anda<x<b, 
This is written 


ic dx = F(b) — F(a) (39) 


and is defined to be the definite integral of f(x) in the interval from 
x=atox=b, 


6.3.1 Geometrical interpretation of the definite integral 


Suppose the function f(x) is positive and continuous in a<x <b. 
Using an intuitive idea of area, consider the area enclosed by the curve 
y = f(x), the x-axis and the ordinates at a and x, where x is some 
point between a and ὁ. This area is a function of x and may be written 
A(x). The problem is to show that A’(x) = f(x), a<x <6. 

Using Fig. 6.1, let Ο be the point between a and ὃ on the x-axis whose 
x-coordinate is x +h, then the ordinate QR is f(x + h). The area 
between the curve, the x-axis, and the ordinates at a and x + his then 
A(x + ἢ) and the area, shaded in Fig. 6.1 is A(x + A) — A(x). 

Let M be the greatest and m the least value of f(x) in the range 
from x to x + h. Then, using the lettering of Fig. 6.1, 
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area of rectangle PORS < A(x + ἢ) — A(x) < area of rectangle POTU 
Le. mh < A(x + ἰὴ — A(x) < Mh 


= 46 + ἢ - 46) 


ΟΓ 


Now, as h->0, x being held fixed, both m and M— f(x), since f is 
continuous, and hence 


om 40+ 2 -- A(x) 


re ae 


or A'(x) = [0]. (40) 


Let F(x) be any indefinite integral of f(x) so that F’(x) = f(x). Since 
F(x) and A(x) have the same derivative they differ by a constant, i.e. 
A(x) — F(x) = k, where k is some constant. When x = a, A(a) = 0, 
hence k = —F(q) and 


A(x) = F(x) — F@), 
or A(x) = ii S(x) dx = F(x) — FQ), (41) 


using the notation of (39). 

It is usual to denote F(x) — F(a) as [F(x)]z where the square bracket 
with its limits of integration a and x, is an instruction to substitute and 
subtract. 

The total area enclosed by the curve y = f(x), the x-axis and the 

ὃ 


ordinates at x = a and x = bis hence F(b) — F(a) or } f(x) dx and is 
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usually called ‘the area under the curve’ from x = a to x = b. More- 
over it follows that 


᾿ Τὴ ἂχ = [ (x) dx + [ Ια) wherea<c<b, (42) 


6.3.2 The ‘sign’ of an area 
When f(x) is negative and ὁ > a > 0, since 


b ὃ 
Ϊ —f(x) dx = Ε(ῶ -- F(b) = -- i Ss) ax, 


then the integral is negative. This is explained geometrically by attaching 
a negative sign to any area below the x-axis. 


Fig. 6.2 


More generally if y = f(x) takes positive and negative values in the 
interval a <x <6 as does the function represented by the curve in 
Fig. 6.2 where the function is negative for ὁ « x < d and positive 
elsewhere in the interval, from (42), 


[ fad = ik T(x) dx + [re dx + [ fx) dx 


= the algebraic sum of the separate integrals. 
Hence the area enclosed between the curve and the x-axis and the 
ordinates at x = a, x = ὁ is the sum of the moduli of the separate 
integrals, 1.6. 


area = [ f(x) dx 


+ 


[ 7) dx| + If J (x) dx 


π wT 
Example [ cos x dx = [sin x| = sin7 — sin0 = 0, 


Ὁ 0 
but the area enclosed between the curve y = cos x, the x-axis, and the 
ordinates at x = O and x = π 15 


ἐπ π ἐπ 
[ cos xdx -- | cosxdy = 2 { cos x dx = 2. 
0 7 [ 
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6.4 INTEGRATION OF THE RECIPROCAL OF A 
QUADRATIC 


6.4.1 The integrand of the form 1/(x? + px + 4) 
Consider the integral 


ax : 
lace P.q given constants. (43) 


Since x? + px + q = (x + 4p)? + q — 4p’, the form of the integral 
depends on the value of ¢g — 4p? and there are three cases to consider: 


(a) If p? = 4q, x? + px +q = (x + 4p)? and 


| ag Rs fa ON i 5 

+ pxtq απ (Ὁ 4p)’ 

(b) if p? < 44, (x + 4p)? + q — 4p? = (x — a)? + f? where « and β 
are real. Then from (30) of section 6.2.1 


dx 1 x— a 
----..---.Θ0Θ:.ὲ -Ξ -------.----  -- -ἰ -1 . 
jasper Jemappes ὦ 
(c) if p? > 44, x? + px + q has real factors and the integral may be 
evaluated using partial fractions. Suppose 


x? + px + q = (x — a)(x — β) 
where « and β are real, then 


1 - ὰ -- βὶ Wa -- p) 
α--ἡα -β χ-α x—B 
dx dx 1 —« 
πρὶ πα 3 


6.4.2 The integrand of the form (ax + 5)/(x? + px + 9) 
Consider the integral 
(ax + b) dx ; 
x4 px tq’ a, ὃ, p, 4 given real constants. 
(a) It is possible to choose constants A, ~ such that 
ax +b=AQx+p)+ 4, 


i.e. A (derivative of the denominator) + μ. Hence A = 4a, μ = ὃ — fap, 
then 
(ax + b) dx [ (2x + p) dx { dx 
ες ΤῊ 


The first integral is A log |x? + px + 4] and the second has already 
been dealt with in section 6.4.1. 
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(Ὁ) When the denominator x? + px + q can be resolved into real, 
distinct factors it is easier to write (ax + b)/(x? + px + q) in its partial 
fractions immediately. Suppose x? + px + 4 = (x — a)(x — A), α, B 
real, then write 


ax +b _ A Es B 
α -- αὐα --β᾽,Γ, χ--α  χ-β 
where A, B are constants to be found. The constants may be found by 


one of three methods, (i) the ‘cover-up’ method; (ii) the ‘identity’ 
method; (iii) the ‘comparison’ method, which are explained as follows: 


(47) 


(i) The ‘cover-up’ method 
Multiply both sides of (47) by x — « to give 


and set x =a so that A = (ax + b)/(« — f). This is equivalent to 
‘covering-up’ the factor x — α on the left of (47) and writing x = « 
in the remaining fraction to give the value of A. Similarly B is found 
to be (af + 5)/(8 — «) when the factor x — β is covered up on the 
left of (47) and β substituted for x in the remaining fraction. 

This method is very quick and always works when the factors in the 
denominator are simple distinct linear factors. 


(ii) The ‘identity’ method 
Consider the identity, supposed true for all values of x, 

ax +b = A(x — β) + B(x — a) (48) 
obtained from (47) by multiplying by (x — «)(x — B). In (48) set 
x = aand A = (aa + b)/(« — ), set x = Band B = (αβ + b)/(B — «). 
(iii) The ‘comparison’ method 
Write (48) in the form 

ax + ὃ =(A + B)x + (—AB — Ba), 


and equate coefficients of x and constant terms to give two simultaneous 
equations, a = A + Band ὃ = —Af — Βα from which A and B are 
found. 


6.4.3 The integrand of the form f(x)/(x? + px + 4) 


When the function f(x) is a polynomial of degree two or more it is 
necessary to divide by the denominator until the integral is of the form 
(ax + b)/(x* + px + 4). The integral is then evaluated using one of 
the methods outlined above. 
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. dx x 
Exanple ᾧ [5a | T=HEFD 


= first res 
=F log xt? 


ὃς dx —1 
Bxumple Wi) Ϊ 4x2 — 4x +17 JQx—1?~ γῶχ-- ἢ 


τς dx ad 
Example (iii) [πττ-τ ἢ = lees 
af ks ad] 
= 73 tan 3 
Example (iv) 


(l—x)dx [ -- ἰῶχ -- 1) 
t—xtx? J l—-x+x 
= Stan (FA) — tlog(l — x + x’). 
(2x — 3) dx 3 5/3 
Example (v) ($25) - [5:33 


= $log |x -- 2] + flog |x + 1 


dx 


Example (vi) 
| x? dx -{{ 2x + 3 4 
χε 2χ- 3 ~ 4x +3)” 
{-- _ 2x+2 _ 1 \ 
ἢ Eo 43° & +)? +2) 


= x — log (x? + 2x + 3) — Sy tan ΓΞ ἢ 


6.5 INTEGRATION OF RATIONAL FUNCTIONS 
Consider the integration of the rational function 

P(x) 

R(x) 
where P(x) and R(x) are polynomials in x, R(x) is not necessarily a 


quadratic and the factors of R(x) are known. The procedure is as 
follows: 


(49) 
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(a) if the degree of P(x) is not less than the degree of R(x), divide the 
denominator into the numerator until the remainder is of the form 


SO) 
R(x) 


where the degree of /(x) is less than the degree of R(x); 
(b) separate the remainder (50) into its partial fractions. Let R(x) = 
(x — a(x — B)"(x? + px + 9)", where m and n are positive integers, 
so that (50) becomes 

fe) 


(x — a(x — B)"(x? + px + 4)" 
and the degree of f(x) is less than m + 2n + 1. Then the following rules 
apply: 
(i) to each distinct linear factor such as x — a, there corresponds a 
fraction A/(x — «) where A is a constant to be found by one of the 
methods of section 6.4.2 (b); 


(ii) to each repeated factor such as (x — f)™ there corresponds m 
fractions of the form 


a SBN τς δ 
x—B* α-β'  α-- BP eB” 


(50) 


and B,, i= 1,2,3,. . .,m are constants to be found. The ‘cover-up’ 
method may be used to find the value of Βα; 

(iii) to each quadratic factor such as (x? + px + 4)" there correspond 
fractions of the form 


C.x + D, C2x + De + ey Ο,χ + Da 
xA+ pxtq' X?+pxtg?* Π OP +pxtq” 
where C,, D;, i = 1, 2,3,. . .,m are constants to be found. 


Example Express 
x*—x?—1 


IO = Gx DEF FFD 


as a sum of partial fractions and hence evaluate [ I(x) ax. 
Solution Since the degree of the numerator is greater than the degree 
of the denominator, it is necessary to divide so that 
6x — 1 
(x — 2)(x? + 2x + 3) 


Since x? + 2x + 3 will not factorize into real factors, partial fractions 
are given by 


fo) = x4 


6x — 1 A Bx+C 


ἃ — 2? + ox + 3) χ- χ᾽ χε χα 3 


14 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


Then A = 1 by the ‘cover-up’ method, and the ‘comparison’ method 
leads to values B = ~1, C = 2 and 


1 x—2 
IQS e =o eS 


Hence 
5 1 4Qx + 2) 3 
[rac = {x +45 - gO? + elo 
= 4x? + log |x — 2] — ὁ log (x? + 2x + 3) 


3+/2 _, {x+1 
εὐ tan” ( + 


6.6 INTEGRATION OF FUNCTIONS WITH 
IRRATIONAL DENOMINATOR 


Consider the integration of functions of the type 
ax +5 


Veet + px +4) a, ὃ, p,q real constants. 


6.6.1 Numerator unity 
The method of evaluating 


i dx 
V(x? + px + 4) 


is very similar to that of section 6.4.1, that is, by completing the square 
of the quadratic x? + px + q and using one of the standard forms 
indicated by (29), (31) or (32). 


Example Evaluate | 
Pe 8 dx 
Js νῷ — 4x + 13) 
Solution The quadratic x? — 4x + 13 = 9 + (x — 2)? and hence 


x 


8 d. 
r= VO+@—23 


— | spp-2 (51 2}7]" 
= [sinh (5 )I. 
= sinh7? 2 — sinh-' 1 


= log {2 + ¥(2? + 1} — log (1 + ν 5 + 1} 


2+ V5 
1+ 2 


= log 
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6.6.2 Numerator in linear form 

The method of evaluating 
(ax + δ) 

V(x? + px + Ἢ 

is very similar to that of section 6.4.2 (a). 


ΓΞ loess (x — 2) dx 
VE — 4x — 7) 
Solution The derivative of 5 — 4x — x? is —4 — 2x and the numerator 
of the integrand is written x — 2 = —}(—4 — 2x) — 4, so that 
1 (—4 — 2x) Ix 
aa -- ἡ τ τς 
= --,κνῷ -- 4x -- x’) -- 4 ςἰη- 1 A(x + 2)}. 


x, a, b, p,q given, real constants, 


Example Evaluate 


6.7 INTEGRATION BY PARTS 


The method of integration called integration by parts depends on the 
formula for the differentiation of a product, namely that 


d(uv) _ τ du 
dx Gere x 


where wu and v are differentiable functions of x. 
Suppose f and g are differentiable functions of x and that 


G(x) = | g(x) dx, then the formula for integration by parts is 


fe dx = 75 - lor dx (51) 


The derivative of the right side of this equation is fg + 76 — Gf’ 
which is fg, the derivative of the left side and this proves (51). 

For this method to be of use it is necessary that the integral on 
the right of (51) should be easier to evaluate than the given integral. 
Note that of the two terms f and g the latter is integrated in each term 
on the right of (51). 


1 
Example (i) Evaluate J = I xe%* dx. 
Ὁ 


Solution Both functions x and e** have simple integrals. If x is chosen 
as the function to be integrated, the power of x increases and the 
resulting integral is no simpler. If x is chosen as the function to be 
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differentiated the resulting integral is simpler than the original. Hence, 
choosing f = x, g = e**, then G = }e** and 
1 
I= [ xe dx 
Ὁ 
1 


Ξε [te]: -- [ κε3: dx 


«Ὁ 


= [axe - ΩΝ 


= (268 + 1)/9. 


This method of integration involves the product of two functions 
and it is necessary to integrate one of the functions and differentiate 
the other. At first the choice of function to be integrated may appear 
difficult. Once one of the functions is chosen to be integrated then this 
function must continue to be integrated should it be necessary to 
perform further integration by parts. 


Example (ii) Evaluate J = | 6525 sin 3x dx. 
Solution Choose g = e** then f = sin 3x and 


I = $e? sin 3x — [em (3 cos 3x) dx. 


Using formula (51) again for fe: cos 3x dx and again choosing 


g = e*, leads to 
[ οὔτ cos 3x dx = he** cos 3x — | 4e7*(—3 sin 3x) dx. 


Thus I = 4e?* sin 3x — ge?" cos 3x — 91/4 
and transposing the term containing J on the right to the left gives 
1317/4 = 4e?* sin 3x — 3e?* cos 3x, 
and I = e*(2 sin 3x — 3 cos 3x)/13 + an arbitrary constant. 
It is recommended that the reader evaluates this integral by choosing 
5 = sin 3x and f = e”*. 


Formula (51) is used to integrate functions whose derivative is known 
and whose integral is not of standard form, such as log x, tan-? x etc. 


Example (iii) Evaluate | log x dx. 
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Solution The integrand is considered as the product of two functions, 
J = log x and g = 1. Then 


flog x ax =x log x - fxars dx 


= x log x — x + an arbitrary constant. 


1 


Example (iv) Evaluate Ϊ tan“! x dx. 


Ὁ 


Solution Let f = tan"! x, g = 1, then 
1 1 1 
i tan-1 x dx = E tan? x] -- I {x/(. + x?)} dx 
0 0 Ὁ 


= [xtan-* x — # tog ( +29] 
= tan7? 1 — $log2 
= ζπ — $log2. 


6.8 REDUCTION FORMULAE 


The technique of integrating by parts repeatedly is often necessary to 
evaluate an integral such as 


1 
0 


1, =[ex dx. (32) 

Integrating by parts gives 
I, = εἶχ" — | e’nx"-1 dx (53) 
and the integral on the right is the same integral as the original of 


equation (52) with n — I in place of nm. Hence it is written 7,-, and 
integral (53) becomes 


I, = εἶχ" — ἡ]... (54) 


This is the reduction formula for the given integral which may be used 
for any value of n > 0. For example, if n = 3, Jz = e*x? — 31), Ig is 
evaluated by writing n = 2 in formula (54) as I, = e*x? — 2/,, and 
similarly J, = e*x — 10, [0 = θ᾽ so that finally 

Tz = x3e* — 3x*e* + ὄχεξ -- 6e7 + C 
where C is an arbitrary constant. 


Simple results are often achieved if the integrals to be reduced are 
definite integrals. 


Example Obtain a reduction formula for 


ἐπ : 
= [ sin" x dx, n>0. 
ο 


118 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


Solution The function sin* x may be written as the product of sin*-! x 
and sin x so that integrating sin x and differentiating sin*-1 x and 
integrating by parts, 


ἐπ ἐπ 
i= | -c0s x sin™"+ x| - [ (—cos x)(n — 1) βίηπ 2 x cos x dx 
0 0 
or la 
7, = (κα -- 1) [ sin"? x cos? x dx. (55) 
«Ὁ 
Now in 
1 ΞὰΞΞ ( sin"? x dx. 
Ὁ 


Hence it is necessary to transform the integral in equation (55) using 
the identity cos? x + sin? x = 1, so that 


ἐπ 
L=(n- »| sin®-? x(1 — sin? x) dx, 
0 
or L=(n—-—Dh-2—-(n— Dh. 
Solving for J, gives 
nl, = (n - 1)... 


= 
or jee —— Ins (56) 


which is the required reduction formula. 

When n is a positive integer the formula may be applied repeatedly 
until the remaining integral is either J,, when n is odd, or J, when n 
is even, where 


in ἐπ 
i= { snxdx=1 and h= { dx = ἐπ. 
0 Ὁ 


Writing n — 2 for n in formula (56) gives 
n—3 


In-2 an 


nmin ® 


and writing n — 4 for n in formula (56) gives 


and so on, so that finally, 
n—in—3n—-5S 
no π--δη-- 4) 
(n — 1) --- 3)m -- 5)... .4.2 
n(n --- 2)0ι -- 4... .5.3 


1. ΞΞ 


. 9 or ἢ. 


When nisodd, [|,,.3 


> 
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6.9 FURTHER SUBSTITUTIONS 


Rule (c) of section 6.2.1 is a substitution rule, but more generally let 
F'(x) = f(x), then 


[FON = fbOO}8') 
and [gee de = [πφολάμο = [Λα (57) 


where u = ¢(x) and after integration u is replaced by ¢(x). For example, 


i {$(x)}"6'(x) dx = [ udu where u = d(x), 
u™ +1 


=n+1 


= es πα τι 


Sometimes it is necessary to write x = g(u) say instead of u = (x) in 

the given integral. Then the function g(u) is the inverse of d(x) and 

hence must be chosen to be monotonic in the range of integration. 
Consider the definite integral 


[τ dx = F(b) — F(a). 


Suppose that the substitution x = g(u) is made such that as x varies 
monotonically from a to δ, u varies monotonically from « to β where 


a = g(a), ὃ = g(), then 


[ ” flo) dx = F{g(8)} — Fle(a)}. 


Some general substitutions follow, where a and ὃ are given constants: 


if \/(ax + 8) occurs, substitute ax + ὃ = u?; (58) 
if »/(a? — x?) occurs, substitute x = asin 8, —$n <0 < ἐπ; (59) 
if a? + x? occurs, substitute x = a tan 0, --ᾧπ < 0 < ἔπ; (60) 
if 4/(x? + a?) occurs, substitute x = a tan 9, —47 < 0 < ἐπ (61) 
or x = asinhu; (62) 


if 4/(x? — a?) occurs, substitute x = acoshu, u> 0. (63) 
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Example (i) Evaluate 


1 x 
I= so 
[ {νὰ + χ»" 
Solution Substitute x = tan 0, --π < 6 < 4a, then dx = sec? 6 d@: 


when x = 0, 6 = 0; when x = 1, 0 = ζπ, and {+/(1 + x*)}® = sec? 6. 
Then 


[΄ sec? θ 40 
r= | eth =|" cos 5969 = -ἰς. 
Example (ii) Evaluate 
I= x? dx 
τυ VOFED 


Solution The substitution x = tan 6 transforms the integral into 


[rane 6 sec 6 d@ which is no simpler than the original. Hence substitute 


xX = sinh u, dx = cosh u du and 
f= | sinh? u du 


= + | (cosh 2u — 1) du 


= } sinh 2u — ἐμ 
= }sinhucoshu — ἐμ 
= ἐχν (3 + 1) — F sinh“! x 


6.9.1 Integration using t = tan 3x 

Rational functions of sin x and cos x may be reseed to rational 
functions of ¢ by the substitution ¢ = tan $x, --- ἀπ < 4x < }o. In 
particular this substitution is of use in integrating Functions containing 


expressions a cos x + ὦ sin x, where a and ὁ are constants. The method 
uses the following results: 


1-7, 2t ; 
cosx= 1 Ἐπ: sinx = rarer and 4sec? 4x dx = dt 
so that dx = 2dt/(1 + 12. 


The substitution is cumbersome and is used only when the simpler 
methods of earlier sections have failed. 


ἐπ dx 
Example Evaluate ᾧ sinx + cosx +1. 


Solution Substitute t = tan 4x and use the results above and the 
integral becomes 

1 at ak 
᾿ era [tog d+ 2] = log 2. 


INTEGRATION 121 


6.10 INFINITE INTEGRALS 


Infinite integrals are those in which the limits of integration or the 
integrand or both become infinite. In section 6.3 the integral of f(x) 
was defined in the range a < x < ὃ, on the assumptions that f(x) is 
defined at each point in the range and that a and ὃ are finite. This 
definition is now extended to the case where a or b or both become 
infinite, and to the case where /(x) is undefined for a finite number of 
points in the range of integration. 


6.10.1 The limits of integration become infinite 
Let Γ΄ ΟΣ = f(x) for all values of x > a, where a is a finite real number, 


then [ ) dx is defined by 


[ ᾿ fx) dx = Lim [ : fx) dx = Lim F(X) -- F(a) (64) 
a X>o Ja X> 0 


when this limit exists. 
ὃ 
Similarly ] F(x) dx is defined as 


Lim { ᾿ fs) dx = F(b) -- Lim F(X) (65) 
X—+—-—0o JX X+>- 0 


when the limit exists. 
If the integral as a limit exists the integral is said to be convergent. 
Otherwise it is divergent. 
: Ὁ dx 
Example (i) Evaluate [ τ 
1 


; , Χ dx 
Solution Consider [ = log X — log 1 


1 
= log ¥ > οὦ as X > o. 


Hence the integral is divergent. 


οο 


Example (ii) Evaluate { ε΄" dx, k a positive constant. 
Ὁ 
Solution Using (64), 


00 x 
[ ο΄" dx = Lim ο΄ * dx 
oO 


Χορ JO 


1 
= Lim; (1 — e7**) 
χνοκ 


ae 
=; 
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6.10.2 Substitution in infinite integrals 


It is possible to transform an integral with infinite limits into one with 
finite limits by a substitution, as in the following example. 


ao 


dx 
Example Evaluate [ is xt 


Xx 
Solution Consider [ τττ τ and make the substitution x = tan θ so 


dx 
o l+x 
that the integral becomes 
tan71X 
[ d§6 =tan-! ΧΟ which -- ἐπ as Χ --» oo. 
0 


Hence it is possible to evaluate the integral directly as 


oo ἐπ 
[ raf dO = 437. 


6.10.3 The integrand becomes infinite 


Let the integrand f(x) become infinite at the end points of the interval 
a<x <b. Strictly the integral does not exist since the integrand does 
not exist at these points. But a meaning may be attached to the integral, 
called an improper integral. 

Let Γ΄ ΟἹ) = f(x) for a< x <b. If both F(a + 0) --» finite limit 


b 
and F(b — 0) — a finite limit then { J(x) dx is defined to be 
a 
Fb — as F(a + 0). (66) 
Example (i) Evaluate [ νς- 
Solution The integrand becomes infinite when x = 2. Consider 


ΠΕ eG) 
= sin-? (1 — 46). 


As ὃ --» Ὁ, sin~1(1 — $6)» 4m and hence this is the value of the 
integral. 


1 
Example (ii) Evaluate [ log x dx. 


Solution The integrand becomes infinite (and negative) at x = 0. 


Consider 
1 1 
[ log x dx = [χορ x - x| 
δ é 


Ξε ὃ --Ἰ -- δΊορ ὃ. 
Now — 6 log ὃ = (log X)/X where X¥ = 1/6 and 
Lim (— 6 log 6) = Lim {(log X)/X} = 0. 
6—0 X> 0 
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1 
Hence [ log x dx = —1. 
Ὁ 


The definition (66) may be extended to define integration of functions 
which are undefined at a finite number of points within the range. For 
if f(x) becomes infinite at x = c, a < ¢ < b, then using (42) of section 
6.3.1 the integral may be written 


[ f@ ax = [τω dx + [τ dx. 


If F(c — 0) and F(c + 0) both approach finite limits then the integrals 
on the right may be evaluated using (66). 


6.11 OTHER METHODS OF INTEGRATION 

The methods of integration of elementary functions given in this chapter 
are not exhaustive. There are many elementary functions which are not 
exactly integrable, such as cos(x?), 1/4/(1 — x‘), (sin x)/x, exp (—~x?), 
and other methods of integration do exist to deal with these. Numerical 
methods of integration are discussed in section 14.8 of chapter 14. The 
integrals of some important functions are tabulated. Further methods 
are beyond the scope of this book. 


EXERCISE 6 


In the following examples a and ὁ are given constants. Find the in- 
definite integrals of the functions 1-42: 


1 3a?x§, 2 (3x? + 4x + 2), 

3 (ax? + δ), 4 \/(2ax), 

5 v(e*), 6 [V2 + x?) — V2 — x*)I/V/(4 — x4), 
7 coth? x, 8 27, 

9 (5 — 4x)-4, 
10 α[(α — x), 11 (3x + 2)/Bx + 1), 


12 (x? + I/& — 1), 

14 sinh x/(1 + cosh x), 

16 sec x cosec x, 

18 (1 — sin x)/(x + cos x), 


13 1/1 + 9x), 
15 χίν (3 + 1), 
17 (sin x — cos x)/(sin x + cos x), 


19 cos? x, 

21 sin? x cos? x, 

23 cos® x cosec’ x, 

25 cosec* x, 

27 sin 3x cos 5x, 

29 (2x — 1)/(x? + 4x), 

31 (2x — ᾿)ν 3 + 4x + 8), 
33 (x — 5)/V/(4x — x*), 
35 e7/(1 + 632, 

37 1/x(1 — x?), 

39 1/x(x + 1), 

41 (x + 1)/(x? + 4x — 5), 


20 sin’ x, 

22 sin® 4x cos® 4x, 

24 sin? x cos* x, 

26 tan? 5x, 

28 (2x — 1)/(x? + 4x 4+ 4), 
30 (2x — 1)/(x? + 4x 4+ 8), 
32 1,.ν — x — x?), 

34 1/V{x(1 — x}, 

36 1/(x? + 4x + 7), 

38 (Sx — 3), + I — 3), 
40 1/1 — x*), 

42 (χ' — x® — x — 1)/(x3 — x). 
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Evaluate the integrals 43-49: 


43 [ tan x dx, 44 i sin® x cos x dx, 
= [<< x - ih SE as, 

-1 
0 | aay 5, ἢ 
49 [ ae. 


Find the indefinite integrals of the functions 50-54 by integrating by 
parts: 


50 sin=! x, 51 x? tan} x, 
52 xe7/4, 53 x sin x, 
54 x"logx, n#—1. 


Evaluate the integrals 55-58: 


2 ἐπ 
55 [ χ 2 log (1 + x) dx, 56 [ x cos x dx, 
1 Ὁ 


57 [- sin x dx, 58 ie ο΄ 25 cos 3x dx. 
59 Find a reduction formula for J, in terms of J,_, given that 
= fex dx. 
Hence find the value of 15. 
60 Prove that if 7, = [tan x dx, then 
IT, + In-a = (tan"-* x(n — 1), n #1, 
and hence evaluate { x tan® x dx. 


π 
61 Given that J, = [ οὗ sin” x dx prove that, forn > 1, 
0 


(n? + ΤῊ, = πίη — 1)J,-2. 
Hence or otherwise evaluate J, and 75. 


ἐπ 


62 Show that, if 7, = [ x cos" x dx then 


ο 


= 


and evaluate J, and J. 
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63 Find the value of i cos 5x cos nx dx (i) when n = 5, (ii) when 
n#5, ᾿ 

Find the indefinite integrals of the functions 64-68 using the substitu- 

tion given: 

64 1{χν («3 — 2)}, (x = 1/2), 65 1/(e* + 1), (x = —log u), 

66 x(3x? — 2)9, 3x? —2 = 2), 67 χίνα + 1), (νὰ + 1) = 9), 

68 cos x/+/(1 + sin? x), (u = sin x). 


Examine for convergence, and in cases where the integral converges 
evaluate, the definite integrals 69-76: 


0 . ~ dx 
69 [- e* sin x dx, 70 [ G+ ay” 
> log x [ Ἂ ἐχ 
71 i > dx, 72 MG ANGE SE 3’ 
FT dx * dx 
3 | — 74 TT a ny’ 
Ϊ νᾷ -- x) Ϊ V(1 + x?) 


3 dx 1 1+x 
s CaF v6 [ {{π23} 
By writing ¢ = tan ἐχ find the indefinite integrals of the functions 


71-19: 


77 1/3 + 5cos x), 78 sin x/(1 — sin x), 
79 (1 + tan x)/(1 — tan x). 


80 Evaluate 
tr ax 
lo 2+sinx 


7 
Applications of Integration 


7.0 INTRODUCTION 


In this chapter the problem of integration will be explained briefly 
without rigour, with reference to geometrical and physical applications. 


7.1 INTEGRATION AS THE LIMIT OF A SUM 
7.1.1 Area as a limit 


The problem is to consider the area A included between a continuous 
curve y = f(x), the x-axis between x = a and x = 5, and the ordinates 
x =aand x = bas in Fig. 7.1. 


' 
t 
t 
' 
' 
' 
! 
! 
4 i 
' 
! 
! 
4 
1 
{ 
1 
t 


f(b) - fa) 


a ath, ath, 
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The method is to construct a set of rectangles which lie underneath the 
curve whose total area is S, and another set which lie above it whose 
total area is S,, and investigate as to whether they can be made to 
approach each other. 

Let y = f(x) be continuous, increasing, and positive ina<x<b 
and let it be represented by the curve sketched in Fig. 7.1. Divide the 
range ὁ — a into a series of subdivisions with abscissae a, a + hy, 
ath, ...,a+h, = ὃ and complete the rectangles on bases Ay, 
he — hy, hg — hg, . . «5 hy — hya-1. The areas S, and Sy, are given by 


δὲ = f@h + fla + hilhe — Δ) + fla + πολ ᾳ — ha) +... 
+ fla + Ay-1)(An = hy-1) 
= fla + hyyh, + fla + hehe — hy) + fla + has — ha) +. . 
+ fla + Aa)An a hy-1) 
then 5: <A< 52. 


As the number of subdivisions is increased, and the largest of the 
intervals 1, — h,_, is made ‘as small as we please’, then the difference 
between S, and S, approaches zero, i.e. S, and S, tend to a common 
limit A. If, for all possible values of S, and S, A is a fixed number, A is 
defined as the required area. The difference δὲ — S, is given by 


Sz — δὲ = (f(a + hy) — f@jh, + (fa + ha) — fla + My) he — In) 
Ἔ... Τα t+ An) — λα + Ain-1)}ltn — An-1)- 


Let h be the greatest of the numbers A,, (hz — hy), (hg — he) 
(h, — h,-1). Then since f(x) is increasing, the numbers { f(a + Five  #(a)} 
{f(a + hz) — f(a + hy)} ete. are all positive or zero. Hence 


8.) -- Si < ἐπε: + ae — fl@jh + {fla + he) — fla + hy)jh 
- + {fla + An) — fla + hy-s)jh 


= a + hy) — flajh 
= {f®) — flajr. 


Thus, given a number ¢ say, however small, it is possible to choose ἡ 
such that 


c 
< —_—_—_—_—____. 
fe) — fla) 
then So = ΝᾺ < ζ, 


i.e. by taking the subdivisions sufficiently small, the difference 82 — 51 
can be made ‘as small as we please’, and the area A is a definite number. 
Let x, be a value of x in the Ath sub-interval. Then if 


> fl NMhy — bya) > 1 
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as the largest sub-interval (h, — h,-1) - 0 where / is fixed and finite, 
then f(x) is said to have a definite integral defined as 


z=b 
Lim > Αχρ 6x,, where A, — hy, = 6x; (1) 


6z>0 r=a 


Note that Fig. 7.1 shows the case of n equal subdivisions of (b — a). 


7.1.2 Area as integral 
Suppose now that F’(x) = f(x) also. Then, using the mean value 
theorem, 

F(a + hy) — F@ + hy-1) = Oe — ἀκ. DX) (2) 
since x, is a value of x in the kth sub-interval. Summing over all values 
of k, 


FO) -- FO) = (ln — ἵν. ὑγρου. Q) 


If the sum on the right of equation (3) approaches a limit as the 
largest of the intervals /,, — h,-, = 6x, approaches zero, then that 
limit is the same as in (1). When this limit exists it is the definite integral 
of the function f(x) between the limits x = a and x = 5, i.e. 


Ξε ὃ ὃ 
Lim "fix x = area = {Λα ἀν (4) 
zr=a a 
the idea of area being omitted as necessary. 


7.1.3 Functions not necessarily continuous increasing and positive 

In section 7.1.1, y = f(x) was chosen to be continuous, increasing, and 
positive throughout the whole range a < x < ὃ. Similar arrangements 
to those of sections 7.1.1 and 7.1.2 hold if the function decreases 
throughout the range, or if it is negative anywhere in the range. A 
function which decreases and increases through the range may be divided 
into intervals in which the function either decreases or increases, and 
the total area defined as the sum of the separate areas. 

A function which becomes negative in the range may be divided into 
intervals in which the function is either positive or negative, and the 
total area defined as the sum of the separate areas. Since the integral 
is negative when the function is negative the area here is the modulus 
of the integral (see section 6.3.2). 

Moreover, the argument applies to all increasing (or decreasing) 
functions, not only to those which are continuous. For example, it is 
possible to integrate the following function which is increasing but not 
continuous: 

f@=4 in 0<x<}, 

fe)=F in e<x<i, 

fe)=F in F< x<h, 
etc. 

f@® =1 when x=1. 
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See Fig. 7.2 for a sketch of y = f(x). 
This function has an infinite number of discontinuities; nevertheless the 
definition of integral as the limit sum applies. 


Fig. 7.2 


7.1.4 Rules for definite integrals 


ὃ α 
a) [seoae = — [peo ax 
since F(b) — F@ = —{F@ — F(b)}} 
ὃ ο ὃ 
(2) [ S() ax = [ T(x) dx + [ Τὺ) dx 


where c is some value of x between a and ὁ. 


7.2 APPLICATIONS OF INTEGRATION 
7.2.1 Plane areas 


The area enclosed by the curve y = f(x), the x-axis from x = a to 
x = δ and the ordinates at x = a, x = b is, from (4) of section 7.1.2 


Lim Sfx) 6x = [ ” fee) dx (5) 


6z-+0 r=a 


If x = x(t), y = y(t), « <¢ < B where ¢ is a parameter, 


8B 
area = [ y(t) a dt (6) 


Suppose y, = f(x) and y2 = f2(x) are two functions of x which are 
continuous for a <x <b and suppose f,(x) <f,(x) fora<x <b. 
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Then the area bounded by the two curves and the ordinates at x = a, 
x = bis given by 
z=b b 
Lim (h(a) — fied} x= [AO - πολ. Ὁ 
z-O0 r=a a 


See Fig. 7.3. 


In polar coordinates the equation of a curve is r = f(@). Let Po be 
the point (ro, 9) and P, be the point (rp + dr, 4 + 68), see Fig. 7.4. 


Initial line 


Fig. 7.4 


Then the elementary area swept out by the radius OP lies between 
4r2 66 and 3(ro + dr)? δθ, i.e. 


oA 
μὲ «Ὁ < Moo + δι), 
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and thus, in the limit dA/d9 = }r?. Summing elementary areas, the area 
swept out by the radius OP as θ increases from « to β is 


8: β 8 
Lim ¥ 1.280 = [ dr? dO (8) 
66-0 6=a α 


In terms of the corresponding Cartesian coordinates, x = r cos 0, 
y =rsin 6, tan θ = y/x and sec? 6 dd = (x dy — y dx)/x. Also since 
r = x sec 0, r? 40 = x dy — y dx and the area may be written 


9 
Pica — yay 9) 
which is the area swept out by the radius as P moves from P to Q 


along the curve. 


Example (i) Find the area bounded by the parabola y = 2 — x? and 
the straight line y = x. 


Solution The required area is sketched in Fig. 7.5. 


Fig. 7.5 


The curves intersect when x = 1 and x = —2, and for all values of x 
between —2 and 1 the parabola is above the line by an amount γ; — ya 
where y, = 2 — x? and y. = x. Hence the total area A between the 
curves is given by 


1 
A=Lim > οἱ — γὴ δα = [ (2 — x? — x) dx = 44. 
ὅτ» —-2 
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Example (ii) Find the area between the x-axis and the cycloid 
x = a(t — sin’), y = a(1 — cos t),0 <t <2z, a sketch of which is 
given in Fig. 7.6. 


Cycloid 2 = a(t-sint), y=a(1-cost} 


O<t<2r 


ο απ 2ar =x 


Fig. 7.6 

Solution Using (6) the required area A is given by 

2π 

A= { a(1 — cos t)a(1 — cos t) dt = 3na?. 

Ὁ 
Example (iii) Find the area A enclosed by the cardioid r = a(1 + cos 6), 
—7<0<7. 
Solution From (8), 


A= [΄ 363 + cos θ)3 dO = 37a? 


7.2.2 Volumes 

The volumes of many solids can be found by summing elementary 
volumes whose cross-sectional area is known, and taking the limit of 
this sum as the thickness of the element tends to zero. Suppose for 
example, that the solid is bounded by two parallel planes perpendicular 
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to the x-axis at x = a and x = b. Let dV be the volume of a repre- 
sentative elementary volume, or ‘slice’, whose area of cross-section is 
A(x) and whose thickness is 6x, as sketched in Fig. 7.7. 

Then the required volume is the limit of the sum of such elementary 
volumes as the thickness dx —> 0. The total volume V is given by 


ὃ 
V = Lim Σ 40) ὃχ = [ A(x) dx. (10) 
6z—>0 a 

In particular, when the solid is a solid of revolution formed by 
rotating a plane area about the x-axis or y-axis, a simple result may be 
obtained. Let the plane area be that bounded by the arc of the curve 
y = f(x), the x-axis and the ordinates x = a, x = ὃ, and suppose that 
this area is rotated about the x-axis to form a solid of revolution. 


Fig. 7.8 


The representative element of volume is then a disc of radius y = f(x), 
and hence A(x) = zry?, (see Fig. 7.8). Then from (10) the volume of the 
solid V is given by 


z= 


=b ὃ ὃ 
V=Lim > mye dx = [τ 2dx or [τῶ a. (11) 


6x0 t=a 


Example (i) Find the volume of the ellipsoid formed by rotating the 
area enclosed by the ellipse x?/a? + y?/b? = 1, y > 0 about the x-axis. 
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Solution y? = δ ( -- x?/a?) —a<x<a 

Volume = 2 [a — x?/a*) dx = 4rab?/3. 
Writing b = a gives the ccintie of a sphere of radius a as 4709/3. 


Example (ii) Find the volume of the ring generated when the circle 
x? + (y — b)? = a*, a < δ, is rotated about the x-axis. 


Solution See Fig. 7.9(a). For each value of x between —a and a there 
are two values of y, say y, and y2 where y, = b + +/(a? — x?) and 
y2=b -- y(a? — x), 


y 


Fig. 7.9 


The area of the cross-section of the ring by a plane perpendicular to 
the x-axis at the point x is 


my? — myz = my + ya — ya) = 2Ζπδ2ν (α3 — x?). 
See Fig. 7.9(b). Hence the required volume is 


4nb./(a? — x*) dx = 2n?ab 


(substitute x = a sin @ to evaluate the integral). 

It is of interest to note that the volume is the product of za?, the 
area of the generating circle, and 27b which is the length of the path 
of the centre of that circle as it rotates through one revolution about 
Ox. This is investigated further in section 7.2.6. 


7.2.3 Centre of mass, centroid 


Let masses "11, mz, m3, . . ., m, be concentrated at points P,, Po, Ps, 
.. . P, at distances from a fixed plane x = 0 of x1, Xo, Χρ, - - ., Xn 
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Their moment about the plane is defined to be 
n 
xXymMy, + XgMNg + X3M3 + oe + x,N, = > XIN ks (12) 
K=1 


known sometimes as the first moment of the system about the plane 
x = 0. 


n 
If the total mass Σ᾽ m, = M say, then the centre of mass G of the 


k=1 
system is the point at which a concentrated mass M would be 
situated in order to have the same moment about x = 0. Let G have 
coordinate X, then the position of G is given by 


n n 
ΧΜ or ΧΣ m= Σ᾽ xm. (13) 
κει k=1 
If the masses are located at points whose coordinates are (x1, }.» 21), 
(X2, Vas Z2)>+ + +» (Xns Yas Zn) With reference to fixed axes, it follows that 


G has coordinates (, §, Z) given by 
n n n 
ME = > xem, MY = Σ yim, MZ= > zm, (14) 
k=1 k=1 k=1 


The centroid of the points P,, Pz, Ps, . . ., P, is the geometric point 
defined by equations (14) when the masses m,, are taken equal so that 
the centroid or mean centre of the system of points is the point whose 
coordinates are 
n 1 2 1 2 
Σ Xm FR DL Wo ΖΞ" Σ % (15) 


nK=1 ἡ κξι π κει 
In order to find the centre of mass of most physical objects it is 
necessary to make certain simplifying assumptions. One is that matter 
in a given solid is continuously distributed throughout the solid. 
Another is that if P is a point in the solid and dV is an element of 
volume containing P whose mass is 6m, then the ratio ém/éV tends to 
a definite limit called the density of the solid at P, and denoted by p(P). 


7.2.4 Centre of mass of a plane lamina 


Consider the plane lamina formed by the area A enclosed between the 
curve y = f(x), the x-axis from x =a tox = b and the ordinates 
x =a, x = b, see Fig. 7.10. 

Let the coordinates of the centre of mass of this area be (x, ἢ). The 
elementary area is y 6x so that the elementary mass dm = py 6x and 
its centre of mass is approximately the point G (x, 3y). Taking moments 
about the x- and y-axes, 


XD ydx p = Dxy dx p 
PS ydx p = Thy" xp. (16) 
If p is constant, mass is proportional to area and, in the limit (16) is 
equivalent to 


ὃ 
Az = [ yx dx 
a 
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ὃ 
»»- [ν᾽ ἃ a7) 


1.6. (ἃ, f) are the coordinates of the centroid of the area. 


Example (i) Find the coordinates of the centroid of a plane semi- 
circular lamina of radius a (Fig. 7.11). 


Fig. 7.11 


Solution This is the area under the curve 
y= VV@ — x), -—a<><a 


Total area A = ἐπα. 
Let the coordinates of the centroid G be (i, ,), then x = 0 from 
symmetry, and 


a 
Aj = hy? dx 


πα 


Ξε [- h(a? — x?) dx 
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= [ω — x*) dx = ξα3, 
0 


. 4a 
then = 5 
Example (ii) Find the coordinates of the centroid of the area contained 
between the x-axis and the cycloid x = a(t — sin 1), y = a(1 — cos), 
0<¢<2r. 


Solution From example (ii) of section 7.2.1 the area A enclosed is 3za?. 
Let the centroid G have coordinates (Χ, ¥), then from symmetry Ζ = a, 
or alternatively X may be found from 


2a 
AX = [ a(t — sin t)a(1 — cos t)a(1 — cos ἢ) dt 
Ὁ 
since dx = α(Ϊ — cost) dt. 


2a 
Also, Aj= [ 4a*(1 — cos t)?a(1 — cos ὦ) dt 


«Ὁ 


2a 
12° | (ὁ — 4cos t + 3 cos 21 + sin? ¢ cos 1) dt 
0 


2π 


ll 


4a8 [a —4sint + }sin 21 + 4 sin® ἢ 


Sra’. 
Hence jp = $a and G is (a, $a). 


0 


7.2.5 The centroid of the volume of a solid of revolution 
Let the solid be generated by rotating the plane area under the curve 


y = f(x) from x = a to x = b about the x-axis. Using (11) of section 
7.2.2 the total volume V is given by 


ὃ 
V= [ my? dx. 
a 


The elementary volume is zy?dx and its centroid is at (x, 0) approxi- 
mately. Hence ¥ = 0 and taking moments about the y-axis, summing 
and proceeding to the limits, 


ὃ 
VE τ i my?x dx. (18) 
a 


Example (i) Find the coordinates of the centroid of a solid hemisphere 
with radius a. 


Solution Let the hemisphere be formed by rotating that part of the area 
of the circle bounded by x? + y? = a? for which x > 0, y > 0 about 
the x-axis (see Fig. 7.12). 

Let the coordinates of the centroid G be (%, ἢ), then =O and 
V = ξπαϑ. From equation (18), 


a 
VE = [ my?x dx 
Ὁ 
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a 

= «ἰ x(a? — x?) dx 
Ὁ 

= ἐπα 


and ἃ = 3a. Hence, G is at (Za, 0). 


ν΄, 
2 


ASS AAAS AARNE SSS 


-a 


Fig. 7.12 


Fig. 7.13 


Example (ii) Find the coordinates of the centre of mass of a right 
circular cone of height ἢ the density of which varies as the square of 


the distance along the axis from the vertex. 


APPLICATIONS OF INTEGRATION 139 


Solution Choose the vertex of the cone as the origin of the coordinates 
and the axis of the cone as the x-axis. Let the cone be generated by 
rotating about the x-axis the area enclosed by y = mx, 0 <x <h the 
x-axis between x = 0 and x =A and the ordinate x =h. Let the 
density be kx? where k is a constant. Let the coordinates of the centre 
of mass G be (%, ¥) then p = 0 by symmetry, see Fig. 7.13. 


Total mass = limit of }zry?6dx kx? 
h 
=kr [ m?x* dx 
Ὁ 
= tknm?h’. 
Taking moments about Oy, 
h 
tham hx = [ (my? dx) (kx*)x 
hk 
= kr { m?x® dx 
Ὁ 


= tknmh® 
and Χ = $h and G is (8h, 0). 


The reader is left to check that if the density is constant, so that mass 
is proportional to volume, then G is the point (3A, 0). 


ο 
» 
ΕΞ 
Ὁ 
8 


Fig. 7.14 


7.2.6 Pappus’ theorem 

When a plane area A, as in Fig. 7.14, is rotated about an axis in its 
plane which does not intersect the area, there is a relation between the 
centroid of the area and the volume of the solid of revolution formed 
by rotating the area about the axis. This relation is a formula in the 
form of the theorem below, of use in calculating the volume when the 
centroid is known, or in calculating the centroid when the volume is 
known. 
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For, let the axis of revolution be the x-axis, and 6A an element of area 
distant y from the x-axis. If V is the volume obtained by rotating A 
about Ox, 6V the corresponding element of volume obtained by 
rotating 6A about Ox, and jf the coordinate of the centroid of A, then 


OV = 2πγ OA. 
Hence V = Lim >) 2zy 6A 
= 20 Lim Dy 6A 
= 2π (moment of area about Ox) 
= 2n(Aj) 
or V = 2nfA, (19) 


and Pappus’ theorem states that if a plane closed area is completely 
rotated about an axis in its own plane which does not intersect the area, 
the volume of the solid formed is equal to the product of the area and 
the length of the path traced out by the centroid of this area. Note 
that the axis may touch the area. 


Example Find the centroid of a plane semicircular lamina of radius a. 
Solution The semicircular lamina, as in Fig. 7.11, is chosen to be that 


bounded by the curve x? + y? = a’, y > Oand the x-axis, —a <x <a. 
Hence A = ἐπα, V = 4πα3 3, 


and y= THA 


Compare this method with that of example (i) of section 7.2.4. 


7.2.7 Moments of inertia 

Let masses 7m, m2, M3, . . ., m, be concentrated at points P;, Po, Ps, 
. . ., P, whose distances from a fixed line / are rj, ra, 7s, . « .» fn» Then 
the second moment or moment of inertia of the system about / is defined 
as 


n 
I, = my? + mor? + mar?+. 2. + mr? = Y mr? (20) 
{Ξι 


Let M be the total mass of the system, then the equation 
Mk? = I, (21) 


defines k,, the radius of gyration of the system about ἰ. 

If instead of a system of discrete masses there is a continuous distri- 
bution of mass, the assumption is made that the mass may be divided 
into small elements of mass dm such that, if r represents the distance 
of one point of the element dm from an axis, then all points of that 
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element will be within distances r + ¢ of the axis where c — 0 as the 
largest value of 6m— 0. Then for each element of mass, its moment 
of inertia lies between (r — c)?dm and (r + c)?dm and hence differs 
from r?ém by a quantity which is ‘as small as we please’ as dm — 0, i.e. 


Lim >r?6m = Jr dm = I. (22) 
é6m>0 
Example (i) Find the moment of inertia of a thin uniform rod of mass 


M and length / about an axis which is the perpendicular bisector of 
the rod. 


Solution Choose the origin as the mid-point of the rod and the x-axis 
along the rod. Then (22) gives 


i 
r= a ὡς 
«4 
_ MP 
=" 


Fig. 7.15 


Example (ii) Find the moment of inertia of a uniform disc of mass M, 


radius a about an axis through its centre perpendicular to the plane of 
the disc. 


Solution Consider the disc split into concentric circles so that at 


distance r from the centre the width of an element of area is dr, see 
Fig. 7.15, 
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Then 6m = 2nr δυ( Μ|παΞ) = (2M/a?)r or and 
22M 
r= —z 7° dr = 4Ma’. 
. ὦ 
Example (iii) Find the moment of inertia of a solid homogeneous 


sphere of total mass M and radius a about a diameter. 


Solution Choose axes Ox, Oy, Oz through the centre of the sphere. 
Let the sphere be cut into elementary discs by planes perpendicular to 
the z-axis at intervals 6z as in Fig. 7.16. 


ἘΠ 


LMM 82 


Fig. 7.16 


The disc between z and z + 6z is approximately a cylinder of radius 


/(a? — 232), thickness 6z and mass 6m = πίαξ — z?)6z, and its 


3M 4πα3 
moment of inertia about ΟΖ is ia (a? — z*)dz (a? — Ζῇ), (applying 


the result of example (ii) above). Summing the moments of inertia for 
the separate discs and letting 6z — 0 


= [35 (2 _ 2m 
lo= [δα — 2°)? dz 
= Ma’. 


The moment of inertia of a rigid body about a line is important when 
the rotation of the body is being considered. Moments of inertia of only 
the simplest bodies can be found by integration. There are two theorems 
which may be used to calculate moments of inertia. 
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The parallel axis theorem relates the moment of inertia of a body 
about an axis through its centre of mass G to the moment of inertia 
about a parallel axis / distant d from G (see Fig. 7.17). 

Let Ig be the moment of inertia of a body of mass M about an axis Gz 
through the centre of mass G, and let J, be the moment of inertia about 


P(x,,y,12;) 


a line / parallel to Gz so that / is distant d from Gz. Let m, at P(x,, y;, Z1) 
be an element of mass of the body. Then, from formula (20), 


I, = Lim Σ m{y? + dd — x,)} 
= Lim [> m,Q2 + y?) — 2d m,x, + d?Qm,], 
i.e. 1, ΞΞ Ig + d?M. (23) 


The term >m,x, is zero since G is the centre of mass. 

For example, using the result of example (i) of section 7.2.7 and the 
parallel axis theorem, the moment of inertia of a thin uniform rod of 
length / about an axis through one end perpendicular to the rod is 
MIP/12 + (42)?M which is 4M’. 
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A second theorem, for plane laminas only, which is sometimes called 
the perpendicular axis theorem, states that the moment of inertia of a 
plane lamina about an axis perpendicular to its plane is equal to the 
sum of the moments of inertia about any two perpendicular axes in 
the plane which intersect on the first axis. For suppose the plane of the 
lamina is chosen as the xy-plane. Two perpendicular axes in the plane 
are Ox and Oy which intersect on Oz. If 1,, Jz, and J, are the moments 
of inertia about Ox, Oy, and Oz and m, is an element of mass at 
(x,, »ρ.» 0), then 

1. = Lim }m,(x? + y?) 


= Lim [Sm,x?2 + ¥m,y?] 
or 13 = T, + ἢ. (24) 


For example, the moment of inertia of a thin disc of radius a about 
a diameter may be found by equation (24). Choose axes Ox, Oy, and 
Oz through the centre of the disc such that Ox and Oy lie in the plane 
of the disc and Oz is perpendicular to the plane. From symmetry, 
I, = I, and from example(ii)abovel, = 4Ma?. Hence J, = I, = }Ma?, 
which is the required moment of inertia. 

Moments are also of importance in statistics. The first moment, 
defined by (12) of section 7.2.3, is used in computing the mean, and the 
second moment, defined by (20) of section 7.2.7, is used in computing 
the variance of a given set of data. These will be met with again in 
Chapter 15. 


EXERCISE 7 


1 Calculate the area bounded by the two parabolas y = x?, y = }x? 
and the line y = 2x. 

2 Find the total area contained in the astroid x = acos* t, ν ξξ sin*z, 
0<t<2r. 

3 Find the area bounded by the cardioid x = a(2 cos t — cos 21), 
y =a(2sint — sin 21),0 <t < 25. 

4 Find the total area inside the circle r = 2a sin 0. 

5 Find the volume of the solid generated by rotating about the x-axis 
that part of the parabola y = 2x — x? which lies above the x-axis. 

6 Prove that the volume of a spherical cap of height A cut from a 
sphere of radius a is 4ah?(3a — ἢ). 

7 The centres of two spheres of radius 3 cm and 4 cm respectively are 
at a distance 5 cm apart. Prove that the volume common to both 
spheres is 927/15 cm’. 

8 Sketch the curve y = x cos x for 0 < x < ἐπ, and find the volume 
generated when the area contained between this curve and the 
x-axis is rotated about the x-axis. 

9 Find the area of the region enclosed by the curves x? = ay, ay? = χϑ 
and find also the volume generated when this region is rotated about 
the x-axis. 
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10 Show that the curve x(x? + y?) = y? lies entirely in the region 
0 <x <1, and make a rough sketch of the curve. Find the area 
of the region in the first quadrant lying between the curve and the 
line x = 1 and find the x coordinate of the centroid of this region. 
(Hint: Use the substitution x = sin? @ to evaluate the integral.) 

11 Find the area bounded by one loop of the curve y? = χά — x?) 
and the coordinates of the centroid of this area. Hence, or otherwise 
find the volume generated when this area is rotated about the 
y-axis. 


2 
12 Use the substitution x = 2sin?6 to evaluate { x"(2 — x)tdx 


ο 
where n is a positive integer. Find the area enclosed by the loop of 
the curve y? = x4(2 — x) and the coordinates of the centroid of 
that part of this area which lies in the first quadrant. 

13 Sketch the curve x = cos*#, y = sin t, Ὁ - 1 < 2π. Find the area 
enclosed by the curve and the volume of the solid formed when the 
upper half of the area is rotated about the x-axis. 

14 Find the points of intersection of the curves r = a(1 + cos 6) and 
r = 3a(1 — cos 6) and show that the area enclosed between these 
curves is (17/2 — 94/3)a?. 

15 Find the centroid of the area bounded by the lines y = 0, x = at? 
and that part of the parabola y? = 4ax from the origin to the 
point (at?, 2at), where a and ¢ are both positive. Find the volumes 
generated when this area is rotated (i) about the x-axis, (ii) about 
the y-axis. Find the value of ¢ such that these two volumes are equal. 

16 The area bounded by the parabola y = x? and the line y = 4 is 
rotated about the y-axis to generate a solid. Find the moment of 
inertia of this solid about the y-axis, if the total mass of the solid 
is M. 

17 Find the centroid of the sector of a circle of radius a which subtends 
an angle 2α at the centre. Hence, or otherwise find the volume of 
the solid generated when the sector is rotated through one revolution 
about a bounding radius. 

18 Find the moment of inertia of a homogeneous solid sphere of mass 
M and radius a about an axis which is a tangent to the sphere. 

19 A right circular cone has a base radius a and altitude 2a and mass 
M. Find the moment of inertia of the volume of the cone about its 
axis. 

20 Find the coordinates of the centroid of the area in the first quadrant 
bounded by two concentric circles and the coordinate axes, if the 
circles have radii a and ὁ, b> a> 0, and the origin is their 
common centre. Find also the limiting value of these coordinates as 
a approaches b and discuss the meaning of this result. 

21 The area bounded by the curve y? = 4ax, the line x = a and the 
x-axis is rotated (i) about the x-axis, (ii) about the line x =a, 
(iii) about the y-axis. Find the volume generated in each case. 


8 
Matrices and Determinants 


8.0 INTRODUCTION 
The linear algebraic equation of the type 
Vi = 3x, + 7x2 + 6x3 
is of common occurrence in engineering. For example, the x’s could 


be the electrical currents flowing in the branches of a resistive network 
as in Fig. 8.1 and the y, the voltage drop across the network. 


Again, y, could be the weekly output of a factory which uses three 
different kinds of machine making the same product; the x’s are the 
quantities of each machine installed and the weekly outputs of the 
machines are 3, 7, and 6 respectively. 

It is usual to have to consider a set of two or more such linear alge- 
braic equations and it is straightforward to handle up to three such 
equations with integer coefficients without using other than simple 
algebra. However, if the coefficients are decimal or if there are many 
more than three equations it becomes necessary to employ a systematic 
algebraic process which will permit the important features of the set 
of equations to be examined and manipulated, and the information 
contained in the set to be stored for reference without necessarily having 
to solve the equations. 

The most important item of information contained within a set of 
equations is that provided by the coefficients of the algebraic terms. 


For example, the pair of equations 
V1 = 3x, + 7x2 + 6x3 
ye = 5Χ1 = 2X2 + 3x3 
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may be used to calculate the y’s from given values of the x’s. No matter 
how the x’s are changed the y’s are immediately determinable and are 
controlled by the coefficients of the right side, that is by the values of 
the numbers in the ordered array 


3 7 6 
5 - 35} 


Any rectangular array which stores definite information is called a 
matrix. A matrix has no numerical value, it is simply an assembly of 
terms which as a carrier of information may be manipulated according 
to certain algebraic rules. It is necessary in any such manipulation to 
know the exact nature of the information, for example the matrix 


6 2 5 
7 5 1 
12 0 1 


becomes meaningful only when the ‘key’ is given as in 


W L D 
Newcastle United 6 2 5 
Norwich City 7. 3 Ι 
Newtown Wanderers | 12 0 1 


8.1 DEFINITION AND NOTATION 


A matrix A of m rows and n columns is said to be of order m Χ n. 
The components of the matrix are called elements. A double suffix 
notation is used, the element of the ith row and the jth column of the 
matrix A being written a;, The array is usually enclosed in curved 
brackets (. ). Other notations in common use are square brackets [ ] 
and double bars || ||. When there is no ambiguity or doubt the matrix 
may be written A, (a,;), or simply ἀμ. Thus 


44, 2 Gg ... Arn 
Gai G22 Gog +--+ Gan 
G3, Q@z3g 433 +++ Gan 
A= (ay) = a; = . ᾿ : (1) 
ani ane ans πέος τς mn 


An important matrix is the square matrix of order n X ἢ, i.e. one 
of n rows and n columns. Such a matrix occurs for instance when 
dealing with a linear transformation from one set of space variables to 
another. In the two-dimensional Fig. 8.2 the point P has coordinates 
(a, b) when referred to the rectangular Cartesian system Oxy, and 
coordinates (A, B) when referred to the system OXY. 
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Fig. 8.2 


The transformation is given by 


A=acos6 + bsin@ 2 
= —asin θ + bcos 6 ὦ 
and the transformation matrix is the square matrix of order 2 
cos@ sin 8 
eee θ cos 5) : Ὁ 


Fig. 8.3 


The matrix (3) can also be interpreted as representing a rotation of the 
coordinate axes Oxy through an angle @ to the position of OXY. 
Alternatively if the coordinate frame Oxy is fixed as in Fig. 8.3 the 
matrix (3) can be interpreted as representing the rotation of the line 
OP, where P has coordinates (a, δ), through an angle 6 to take up the 
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position OQ, where Q has coordinates (A, B). Changing the sense of 
rotation would then mean that the matrix 


cos@ —sin “| 
sin9@ cos@ 


(4) 


can represent the rotation of the line OQ in the plane of Oxy through 
an angle θ in the anticlockwise sense. « 
A matrix with m = 1 


(α:. Gn Gig... 1.) 
is called a row matrix of order n, and a matrix ΠΗ = 1 
αχι 
Qo 
31 
Amt 
is called a column matrix of order m. 
A general linear transformation of m variables y,, i = 1, 2,3,. «Ὁ, γι 
and n variables x;, j = 1, 2, 3,. . .. m has the form 
Va = αιιχὶ $F AyaXq + αιοχα +. . - Ἔ inXn 
Ya = GaiX1 + AgaXq + AagX3 +. . . + GanXn 
. . ° . . (5) 
Vm = AniX1 + AnaX2 + An3x3 + see + AmnXn- 


In (5) the variables y, and x; may be represented by two column 
matrices 


Va xX 
ΒΖ X2 
y = x= ; 
Ym Xn! 


and the transformation matrix by A = (αμ). The equations may be 
written in compact notation in the form 


τι 
y= Σ ayx; i=1,2,3,...,m 
πεῖ 
or simply ᾿ Vi = αρχ, (6) 


where the repeated index j indicates a summation from 7 = 1 toj =n. 
It will be seen in section 8.2.6 that relation (6) is simply a matrix 
multiplication and that (5) can be written in matrix form 


y = Ax. 
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If i, 7 take only the values 1, 2, 3 then equation (6) can be considered 
as a relationship connecting together the pair of rectangular Cartesian 
coordinates (x1, ΧΩ, x3) and (j1, yo, ys). Suppose P is the point 
(x1, Χο, X3) and Q is the point (),, ye, ys) such that OP = OQ, then 
OQ can be obtained by rotating OP through the angle POQ in the 
plane of OP and OQ. The matrix a,, of equation (6) can then be 
thought of as representing this rotation and must carry information 
concerning both the magnitude of the angle and the location of the 
plane in which it lies. This representation of a matrix is investigated 
further in section 9.3.7. 


8.2 MATRIX ALGEBRA 


Consider now the basic algebraical manipulations of addition, sub- 
traction, and division as applied to matrix arrays. 


8.2.1 Equality of matrices 


Two matrices are equal if and only if they are of the same order and 
the corresponding elements are equal. 


8.2.2 Addition 
Consider the linear transformation 
Vi = 4X, + ia 


(7) 


V2 = ἄφιχι + ἄρχῃ 
Ya = Ag1Xy + Ag2Xq 


having the transformation matrix 
411 Aye 
A= {doi ὥρα (8) 
431 (58. 
and a second transformation 


Ζι = bx, + biaxg 
Ζ2 = bayXy + do2Xe (9) 
23 = bx, + byaX2 
having the transformation matrix 
11 δι 
B= {ba dae}. (10) 
bar Dae, 


If new variables w,, wz, and wz are constructed by adding the respective 
y and z variables, then equations 
Wy = γι + Ζι = Gi + δι})χ} + (Gig + διο)χα 
We = γὼ "Ἔ Ζ2 ΞΞ (@o, + bas)x1 + (G22 + beo)Xo (11) 
Ws = V3 + Z3 = (dg1 + bg1)X1 + (452 + ba2)X2 
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form a linear transformation having a matrix 
Qi t+ by, aie + Dre 

Qa, + ber Gag + doe]. (12) 
Q31 + by, age + Doe 


This process may be extended and leads to the rule of matrix addition. 

Two matrices A and B, each of order m x n, when added together 
form another matrix of order m x n whose elements are the sums of 
the associated elements of A and B, i.e. 


A+ B= (ay) + Gy) = Gy + δὼ) (13) 


Note 1 No meaning can be attached to the addition of matrices of 
different orders. 


Note 2 The result (13) may be extended to any finite number of matrices, 
1.6. 


ΑἜΒ- C= (ay) + Oy) + Cy) = Gi t+ by +ey) (4) 
and so on. 
Note 3 The commutative law and the associative law are true, i.e. 


A+B=B+A and (A+B)4+C=A+4(B+O). 


Example 
3. 6 0 2 3. 8 
7 2+ {-1 5|Ξ-[|6 7|: 
5 0 Bu «ἘΞ 8. - 


8.2.3 Multiplication by a scalar 
Suppose new variables 
Pi=Ayr, P2=Ayx and ps = Ays 


are defined, where 4 is a numerical magnitude usually called a scalar, 
and used with equations (7), then 


A= Aay1X1 + Adi2X2 
p2= λαφιχι + Ada2X2 
pP3= λάφιχι + λάβοχα 


(15) 


is a linear transformation with a matrix (λαμ). This suggests the 
following rule. 


Scalar multiplication 
If a matrix is multiplied by a scalar A then every element is multiplied 
by A, i.e. 


AA = A(ay) = (λα). 
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: Ze ae 1d 6 9 3 
Example (i) 3( 4 3 0) = ( 2 9 a) 
᾿ 3. -ὦ —3 2 
Example (ii) --1 (5 = ( ae 5) . 
Conversely, a matrix has a factor A if each element contains the factor A. 


8.2.4 Linear combination of matrices 


The rules of sections 8.2.2 and 8.2.3 lead to the combination rule in 
the form 


AA + μΒ + 9C = (Aay + μδι + vey) (16) 


where A, μ, and » are scalar quantities. In particular if ν = 0,2 = 1, 
and μ = —1, then 


A— B= (a, — δι). (17) 


8.2.5 Zero matrix 


The subtraction of two equal matrices of order m x n gives a zero 
matrix of order m xX n, that is one whose every element is zero. 


Example 

ge ἋΣ. ἃ 4 3 2 (8. 13. 8 

2(; 2 _3) +3(3 3 3) -('3 13 5 
(6 4 2 (12 9 ἢ Ι8΄ι 13. 8 
Ξὶ 4 -)1τὶὴλς 9 -ο 8. 13 -12 


_ (18 13 8 18 13 8 
"δι: 13 -12ὦ \8 13 —12 


8.2.6 Multiplication of matrices 
Consider the successive linear transformations 
Vi = Ay1Xy + AyoXe and X= δ1.Ζ) + by2Z2 
Yo = AgsX1 + Ag2Xe2 χα = baz, + baeZa 
Ya = AgiX1 + AgqX2 
with coefficient matrices 
αιι Ag b b 
A= μι “| and B= ie "ἢ 
31 as 21 22 


respectively. 
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They are equivalent to a single linear transformation 


Vi = (i101. + Gi2be1)21 + (@11b12 + Gy 2b22)Z2 
y2>= (@o1b11 + Goqbe1)Z1 + (Aoibi2 + G2ab22)22 
3s = (5ιθιι + @s2b01)21 + Gs1bi2 + Aszb22)Z2 


with a coefficient matrix 


Qy1b11 + Qjabe1 Airbi2 + Arebo2 
C = | daibi1 + Ggeb21  darbi2 + Ae2ba2}. 


@31b11 + A32b21 31b12 + sabre 


It is seen that the elements of C are obtained from those of A and B 
by summing the products of the elements of the rows of A with those 
of the columns of B. For example, the element of the third row and 
the first column of C is obtained from the third row of A and the first 
column of B. 


Matrix multiplication 


Given a matrix A of order m x n and a second matrix B of order 
n X p, then the product AB is a third matrix C of order m x p, such 
that the element of the ith row and the kth column of C is the sum of 
the products of the corresponding elements of the ith row of A and 
the kth column of ΜΒ. In a more concise notation, if A = (a,,) and 
B = (δ) where i = 1,2,3,. . .,m;j = 1,2,3,. . .,.n;andk = I, 2, 3, 
. » p, then C = (μὴ where 
Cik = i jD jx (summed over 7) (18) 
= Andy, + Asada, + Qisdax +--+ inDnee 

Note that for the product AB to exist it is essential that there are the 
same number of columns in A as there are rows in B. 


Example (i) 


23 (4 ; = (aoa ΤΠ" ἢ 
-1 2. 4)\, | ~ A\-54+84+12 π3 0.4) \I5 1 
2x3 @ xX 2) (2 x 2) 


Example (ii) 


( } 2.31 Ι0 -- 3 154+6 5412 7 21 17 

4 0} ( )- [τ 124+0 44 τ 12 ἢ 
6-1 9+2 34+ 4 51 7 

(3 x 2) (2 x 3) 3 x 3) 
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Example (iii) 


2 -1 
3 0 
0 —4 
(45 6 -—1 0 2) 5 3 
—2 1 
4 1 
( x 6) (6 X 2) 
=(8+154+0-—54+048 —-4+4+0-—24—3+40+42) 
= (26 —29) 
dl x 2) 
Example (iv) 


(; NIC 6 7) has no meaning. 


2x2) (x3) 


From the definition of matrix multiplication it is clear that if A is of 
order m x n then AB and BA can both exist if and only if Bis of order 
n x m. The product AB is then of order m x m and the product BA 
is of order n x n. These two products are of different order if m #4 ἢ, 
and it may be concluded that the product of A and B is non-commuta- 
tive, 1.6. AB + BA. This is illustrated in the first two examples above. 
Even when ἢ = m and the two products are of the same order they are 
generally unequal. 


Example (v) 
ae Paes oe “i τ | eae Fs ἢ 
PAYNE ο)7 13 ο) ἃ OP 27 ay 
A particular case of the equality AB = BA is discussed in section 8.2.9. 
It is possible to multiply AB by a further matrix C providing the 
matrix order is correct. Given three matrices A = (a,,), B = (b,,), and 


C = (¢x:) of orders m x n,n x p, and p Χ 4 respectively the product 
AB, of order m x p, is given by 


AB = (Aijb jx), 


summed over j = 1, 2,3,. . ., m. The product of AB and C, i.e. (AB)C, 
of order m x q, is given by 


(AB)C = (αμδικομὴ 
in which each element is a double summation taken over j = 1, 2, 3, 


..nand k = 1, 2,3,.. ., p. The order of the summation is imma- 
terial and the same result would be obtained by forming the product 
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A(BC). The product of A, B, and C is associative. It should be clear 
that the product is also distributive, i.e. 


(R + S)\C = RC+ SC 


where Καὶ and S are each of order r x p. 


Example (vi) If A and B are arbitrary square matrices of equal order 
then 


(A + B)? = (4 + BA + B) = AA + AB+ BA + BB. 
Example (vii) If A is any square matrix then 
AAA = 443 = A?A = A® 


where A? means AA. 


Example (viii) Verify the associative law of multiplication of matrices 
by forming the product ABC in two different ways, where 


0 
2 3 
4 1 0 3 1 
sai ἡ, “εὐ 2 eee 
1 2 1 


Solution 


14 -1 --ό 18 
ABC = (ΑΒ) =| 4 1 0 3 


8 -1 πά 1 

2 3 4 5 
and ABC = A(BC)={1 0 ( i} = {4}. 

Ι 2. 2 
Example (ix) The general linear transformation equations (5) of section 
8.1 may be written 


me Oe © 
Hl 
eae 
Noh MN 
ἘΞ 


Ji αι ἄς + ee Ain\ [χὰ 

y2 42, Gon» + ~ Gan X2 

Y mi απ Gm2 +++ Amn Xn 
or simply y = Ax. 


Example ‘x) Solve for x and y given 


(2 -s)(;) = ὦ) 
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Solution Multiply the left side 
21x + >) me ia) 
2x — 55») ~~ \19 


and hence, by section 8.2.1, it follows that 


21x + 6y = 24 
2x — Sy = 19. 
The pair of equations has the solution x = 2, y = —3. 


Example (xi) Find the points gained by each team shown in the foot- 
ball league table in the section 8.0, on the basis of 3 points for a win 
(W), 1 point for a draw (D), and 0 points for a lose (1). 


Solution The points awarded are as follows: 


Newcastle United 6 2 = 5\ /3 
Norwich City =[7 5 1)/0 
Newtown Wanderers 12 0 IT/\1 


1i8+0+5 
={21+0+1 


36+0+41 


8.2.7 The unit matrix 

Suppose an arbitrary rectangular matrix A = (a,;) of order m x n is 
unchanged after multiplication by a square matrix B = (6,,) of order 
n x n,so that AB = A. The elements b;, must then satisfy the equation 


Die = Aisdjx, 
(summed over j= 1, 2, 3,..., m) with i= 1, 2, 3, ..., m and 
k = 1, 2, 3,...,”. This means that the b,, must satisfy a set of 
equations 
ἄτι = 101. + Qyoba +. .. + andar 
αι = αι1θ12 + Qiabong +. .. + Ginbae 
413 = α11θ15 + Aygbog +. -. + Gindas 
etc. 
Since the 4,;, 412. . . are arbitrary, the set of equations leads to 
by = bog = bag = « εὐ τε re | 


with all other by, = 0. 
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The matrix B has a similarity to unity in scalar algebra and is called 
the unit matrix of order n. It is denoted by J,. For example, 


1 00 
I,={0 1 0 and Ι, τ 


00 1 


ooo _ 
ocoo- 
5- ὁ 
λεζο 


Given the matrix A of order m x n it is easily shown that not only is 
Al, = A, but also I,,A = A, and for this reason the unit matrix is 
sometimes called the identity matrix. 

When the order is either understood or immaterial the suffix may be 
dropped and the unit matrix written 1. 


3 7 2 
Example (i) Evaluate [A and AJ when A = ( ) 


oo 
Solution 
1 Ὁ 7 2 3. 1 2 
w= {Ὁ ἽΝ -4 Al =(; _4 3) 
1 0 0 
wd are (7 ἢ ) -ἰ a ΩΝ 
0 0 1 


Example (ii) If 


211 
A= ( 2 ἢ show that A? -- 5A + 41- 0. 
1 1 2 


Solution 


and A? — 54+ 41- 0. 
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A matrix of the form AJ where 1 is a scalar is called a scalar matrix. 
The latter is itself a particular case of a matrix with non-zero diagonal 
terms and all other terms zero. A matrix such as 


a4, 0 O ... 0 
0 a22 0 0 
0 O 453 0 
0 0 0 Ann 


is called a diagonal matrix. 


8.2.8 The transposed matrix 


Given an m Χ n matrix A = (a,,), the transpose of A is written A’ or 
A’ and is that matrix of order n x m obtained from A by interchanging 
the rows and columns. For example, 


6 
624 1 ee 
re a and Av= A = 4 

1 


8.2.9 The inverse matrix 


The next algebraic operation is that corresponding to division. Multi- 
plication and division are inverse processes and it is to be expected that 
‘matrix division’ will be dependent on the multiplication process. In 
scalar algebra division by 3, for example, can be considered as the 
multiplication by the reciprocal of 3. The latter is that number which 
when multiplied by 3 gives unity. In matrix algebra, given a matrix A 
the reciprocal or inverse matrix of A is defined as the matrix B such that 
AB = BA = I. Such a statement requires, of necessity, that both A and 
B are square matrices. When A is square and B exists it will be denoted 
by Aq}, 1.6. 

AA“+=A14 =] (19) 


Note 1 Itis possible that A~* does not exist. This possibility is discussed 
in section 8.5. 


Note 2 The condition that the matrices A and B commute in a general 


manner, 1.6. that AB = BA * J, cannot be dealt with here. It may 
be shown that such matrices can be found by considering the products 


of 
3 -1 ll —3 
A= ( i} and B= ( 6 5) 
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Example (i) Verify that 


1 1 - 3 1 2 
—2 --Ἱ 4] is the inverse of [2 1 O}. 
0 --Ι 1 2 1 1 
Solution 


3 1 2 1 1 = 
(: 1 ἢ (-2 —1 ἢ = 
2 1 1 0 -1 1 
1 1 —2\ /3 1 0 
and (-2 --ἰ Ε 1 ; = I, 
0 —-i 1/ \2 1 1 


The fuller understanding and the evaluation of an inverse matrix is 
dependent on the use of determinants to be defined and discussed in the 
next section. However, one application of the inverse matrix is illus- 
trated in the following example. 


a 
oor 
or Oo OK 
-»ν Oo © 
“eee 

Ι 

Lael 


Example (ii) Solve the system of equations 
3X1 +Xa+2x3= 5 
2x1 + Xe = —3 
2X, + X2+ X= 3. 


Solution In matrix form, the system of equations is written 4x = Β 
where 


3 1 2 xy 5 
A=j]2 1 O}, x=I[xel, and B={-3}. 
211 ΧᾺ 3 


Pre-multiply by the inverse matrix A~+, then 


A-14Ax = 4:18 
or Ix = 4 18 
or x = 4:18. 


Using the result of the previous example 


cee ewe 


and hence x, = —4, x. = 5, and x, = 6. 
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8.3 DETERMINANTS 
The solution of the pair of equations 


4y1X1 + AyoxX2 +b, = ο 
20 
ἄχιχι + ἀλοχλ + bz = 0 ) 
τς Χι = (6,284 — Ae2b1)/(G11420 — pa (21) 
Χα Ξε —(@,1b2 — d21b1)/(@11422 — 12401) 


provided that the denominator is non-zero. The three expressions in 
brackets arise from the cross products of the pairs of elements 


ay (12. ayy δι, and Qa\2 by. 


a 


21 ΡΥ ae be G20 bg 


Such expressions are called determinants of the second order. The 
augmented coefficient matrix of (20) is the 2 x 3 matrix 


41 Aye "ἢ 
22 
ee 22 be 2 


and each determinant of (21) is an evaluation associated with the three 
2 Χ 2 matrices available in (22). For example (a,9b2 — @zb,) is the 
evaluation of the determinant associated with the matrix 


Bs : 
G22 be 


αι} by 
Gon bg 


This determinant is written 


The solution (21) may be written in determinant form 


a en Ea (23) 
Ay. δ. ay, ὅδ. QQ, Aig 
22 bg dq, by δι aga 


. (3 -ἸἼ - : : 
Example The matrix ι: Ι 0) contains the three determinants 
of the second order 


3-7 
2 1 


3 --ὶ 
2 0 


3 


1 0 


--7 --ἰ 
ΝΕ | 


whose values are 17, 2, and 1 respectively. 
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Likewise, if three equations in three unknowns 


411X1 + Ay2X2 + Ay3X3 + δι = 0 
ἄφιχι + ἀ22Χ4 + AogXg + bg = 0 (24) 
4g1X1 + GgoX2 + Ag3X3 + ὃς = 0 


have a solution, each x; is found as the ratio of two six-termed expres- 
sions. The denominator is common to each x, and has the value 


431429033 — A11423432 + 19423431 — Ay 2421433 + 13021030 
— αι 80.963. (25) 


The reader should verify this expression by solving equations (24) for 
x, by eliminating x2 and x3. 

An expression of the type (25) associated, in this case, with a 3 x 3 
matrix whose elements are the coefficients of the x; of (24) is called a 
determinant of the third order and is written 


411 Gq 613 
δι (22 42g ]- 
81 6432 (33 


The other three third order determinants in the solution of (24) are 
associated with the further three 3 x 3 matrices obtained from the 
augmented coefficient matrix of (24). Each determinant contains six 
terms, three prefixed with a + sign and three with a — sign. 

Again, if four equations in four unknowns x, have a solution, each 
x, is found as the ratio of two 24-termed expressions, The denominator 
term is common to each x, and has the value 


411422433044 — 1192034443 + G11A23Mg4dag +... (26) 


to 24 terms. The expression (26) is associated with the 4 x 4 coefficient 
matrix (a,,) and is the evaluation of the fourth order determinant 
written 


Q2, (22 (ὅ243 Ao (27) 


441 42 (48. Aggy 


The 24 terms in the evaluation of (27) (in which 12 are prefixed with 
the — sign) are each of the type 41,423,425 Where «, 8, y, 6 are the 
numbers representing the columns from which the element is taken and 
αβγὸ is some arrangement, or permutation, of the natural order 1234. 
The first term with the + sign is a;1@22433444 and is called the leading 
term. The number 24 is simply the number of ways in which the natural 
order 1234 can be arranged. Six of these arrangements are 1234, 1243, 
1324, 1342, 1432, 1423. The reader should satisfy himself that there are 
4!, i.e. 24 terms in all. 


162 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


8.3.1 The determinant of the ath order 


Ann X n matrix A or (a,,) has associated with it a determinant of the 
nth order, i.e. m rows and n columns, written 


aii ayo . . . Gin 
Ὧι Gog . .. dan 

[4] ΞΞ [Ιαμ[Ξ} ἐν τ]: (28) 
Ani Ang . .. Ann 


The determinant (28), unlike the matrix A which remains a manipula- 
tive array of elements, has a definite value and consists of the sum of n! 
terms each of which is of the form +4,,@0,43,. . «Gna Where aBy. . .A 
is some arrangement of the natural order 123 . . . m and the sign of 
the term depends on the arrangement. 

The formal mathematical approach is to define the value of (28) by 
this sum and to derive the properties of determinants using the term by 
term evaluation. This approach will not be dealt with here but instead 
the evaluation and properties will be deduced from (25) for third order 
determinants, verified as necessary for fourth order determinants and 
then assumed to be true for the general determinant of order n. 


8.3.2 Minors and cofactors 
Consider the third order determinant 


ἄλι G12 ys 


tl 


|A| 


Ὧι 422 423 
81 432 433 


= Qj 1A90033 — (11,565. + @12@23a1 
— Gy 2091433 + A13421432 — 413422431. (29) 


Each element a,; of the determinantal array (29) has associated with it a 
determinant of the second order obtained by striking out the row and 
column containing the element a,,. This determinant is called the minor 
of a, and written M;,;. For example, the minor of ay. is the second 
order determinant 


431 (33 
having the value (a2:¢33 — @3,@23). A re-arrangement of the right side 
of (29) shows that 
[4] = @11(dz2d33 — 432423) — A12(do.d33 — 31423) 
+ @y3(421432 — A319) 
= 411M, — 412My2 + α15Μ,5. (30) 
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The re-arrangement of (29) can be performed in five other ways using 
the elements of the two other rows and three columns. For example, 
two further ways are given by 


|A| = —412M yo + Go2Mo2 — 45.}3. 
= 49:51 — dggMz2 + (5. Μ 95. (31) 


It is more convenient to introduce the cofactor of the element a,;. This 
is usually written A,, and is the signed minor of a,, so that Ai, = +My. 
The sign for the leading element is + and the sign of any other cofactor 
is taken from the array of signs 


ang 
= Seale (32) 
ae 3 SA 
For example, 
411 Qa 


Ags = —Mos = and Asi = + Mg). 


Q31 Age 

The determinant (29) may be written 
441A, + 12412 + 413Ay3, evaluating along the first row, 

OF 1A, + do14a1 + 431491, evaluating along the first column, 

OF Ω; 5441. + Ae2A0q + A32Ao0, evaluating along the second column, 


or in three other ways. The six possible ways of evaluating the third 
order determinants are 


[4| = ἀμάμι + Gi2dio + αι544.5, evaluating along the ith row (33) 
= ;A1; + ἀ5,42;) + 45,44), evaluating along the jth column)’ 
Extending this process to the fourth order determinant and using the 

elements of the first row, say, (27) can be written 
@y3Aq1 + αι... + Qy3413 + A14Ai4 (34) 


in which each of the cofactors Aj, is a third order determinant that 
can be evaluated using (33). Note that (34) becomes 


411 | gg 433 agg | +... to 4 terms, 


Q33 (584 82 Asa 82 (85 
OF Q11 )\ 422 — a23 + az4 
Ὧ44 44 G42 444 42 (45 
+... ἴο 12 terms, 
OF αι, 58,44 — (1, 534445 + A11A2gQasQag +... to 24 terms. 


When written out in full the last expression may be shown to be 
identical with the expression (26). 
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The evaluation of the nth order determinant is now defined by the 
extension of (33). Thus, 
|A| = ἄμάμ + ἀμάμ t+... + anAin (ith row) | (35) 
= αι,4.; + Q2;A 9, +... + QnjAng (jth column) 
in which A,, is the cofactor of a,, and is that determinant of (γι — 1)th 


order obtained by striking out the elements of the ith row and the jth 
column of (28) prefixed with the sign taken from the array 


oe GS ake Cee 
-_ + ae ee 
+ -— + 

(-p - + 


Repeated application of (35) will reduce any nth order determinant to 
n!/6 determinants of the third order or n!/2 determinants of the second 
order prior to evaluation. In practice determinants of large order may 
be simplified by using the properties of the next section, evaluated by 
numerical methods, or may be avoided altogether. 


2 1 1 1 


: 0 21 90 
Example (i) Evaluate 3 0 1 2\ 
1-1 2 #1 
Solution Using the elements of the second row, the evaluation is 
ἡ 
-» [2 1 1͵|ε- |2 1 1 
—0+2/3 1 2|-1||3 O 2/+0. 
1 2 4] 1 —1 l 
t 
Further evaluation along the row and column indicated leads to 
1.2 3 2 3 1 
2 21 —1; [+ 1 Al 


παι τ τόπους al 
Ξε 2(2(--3) — 1 + 5) -- {-1(+1) ἘῸ Ἐ1) 
= 2{--2} — 0 
= --4. 
A useful method of evaluation for a third order determinant only is 


given by the Rule of Sarrus. Repeat the first two columns of the deter- 
minant as shown below and then form the two groups of triple products 
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one prefixed with the + sign and one with the — sign. It will be seen 
that the two groups consist only of the terms of (25) 


2 
Example (iit) Evaluate | 1 -I 7. 
—5 0 


Solution Write 


2 
1 - 7 
0 


and the evaluation is 
+ {4)(—DO) + )0)--5) + 2A)8)} 
— {(—5(-— DQ) + 8)ME4) + OME} 
{0 — 105 + 16} — {10 + 224 + 0} 
—89 — 234 
— 323. 


8.3.3 Some properties of determinants 

The results to be stated in this section are general results and may be 
applied to determinants of any order but illustrations are usually given 
by referring to the third order determinant 


41 M2 As 
|A| = 


G2, (22 (93 


: 43, 32 (53 
with cofactors A,,. 


Property 1 
If every element of a row of a determinant is zero then the value of the 
determinant is zero. 


This follows immediately from (35) by evaluating along the row of 
zero elements. : 


Property 2 

The determinant of any square matrix is equal to the determinant of 
the transposed matrix, i.e. to any property concerning the rows of a 
determinant there is an equivalent property concerning the columns. 
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This result is clearly true for a second order determinant because 


me ae = ae = 411422 — 41412. 
G21 (22 Q12 (22 
Consider 
αι Ag, 1 
|A’| = το A422 (532 


ἄι8 423 agg 
and evaluate using the elements of the first column, i.e. 


|A'| =a 422 (582 δι (581 ἄδι (581 
ΞΞ 11 — Ae. 
423 33 423 (35 ἄς Age 
= 43Ai1 + Q2412 + G13413 
= |4]. 
Property 3 


If each element of a row (column) of a determinant is multiplied by a 
scalar k then the determinant is multiplied by k. 

This follows immediately by evaluating the determinant along the 
row (column) containing the factor k. 


Example (i) 
43, Aq aig αι kay, a 
Kao, kao, Kas = k{A| =] G21 kage 4.3 
431 (4). (88 G3, ΚαβΩ Aaa 
Property 4 


If any two rows (columns) of a determinant are interchanged, then the 
sign of the determinant is changed. 
Consider 

431 432 (38 

[ΔΙ ΞΞῚ ὅφδι Gq oz 

ἄτι Ay2 aig 
obtained from |A| by interchanging the first and third rows, and 
evaluate along the third row. 


[Δ] τὰ Q32 (538 oS Q31 (88 Τῷ 431 (32 
Ἶ 22 (25 on G21 Gea = G21 Aa2 
= @1;(—Aj3) -- 42(A12) + @3(— Ais) 
= —|Al. 


Corollary to Property 4 

If any two rows (columns) of a determinant are identical then the 
determinant is zero because interchange of the rows (columns) would 
lead to a change of sign by this property and yet must at the same time 
leave the determinant value unchanged. 
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Property 5 


The sum of the products of the elements of any row (column) with the 
cofactors of any other row (column) is zero. 


Consider [8] = a,; 431 + G12 42. + G13 4425 in which the elements of 
the first row of |A| are multiplied by the cofactors of the second row. 


a2 Arg 41 13 431, (12 


Then |B| = 


+ ay. 


Ι 
| 
iS 
5 
"Η 


— a3 


431 33 81 Ase 


41 M2 (18 
=~} 411 G12 Ag 
43, 82) 3g 


=0 
by the corollary to Property 4. 

This property may be generalized and, together with the cofactor 
evaluation (35) for the ith row, may be expressed in the single statement 
n O if if<k 
> Buy Any = i if i=k. 

Employing the summation convention for a repeated index this may be 
written 
ay Ax — 51x A] (36) 
where δὲς is called the Kronecker delta and has the property that 
a 0 if if#k 
soe Ue: ἱπεκ. 
Using the elements of the jth column in (35) a result similar to (36) 
may be stated 


(37) 


ἄμ An = 544| A] (38) 


Property 6 


A determinant is unchanged in value if to the elements of a row 
(column) are added a scalar multiple of the elements of any other row 
(column). 


Consider 


Qy, + kag, α1. + kdgg Gig + kdog 
[8] = 21 22 Gog ’ 


431 32 a33 
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[8] = (αι, + kaey)Air + (Gig + kg2)Ai2 + (Aig + kaos) Ais 
= (αιχά1: + Gy2A12 + 13413) + k(Qe1A11 + (2541. + G23A 13) 
= [4] 
the last line following from (36). 


This property may be applied repeatedly to a determinant in order 
to reduce it to a simpler form before or during evaluation. 


Example (iii) Evaluate 


1 5 —3|R 
2 7 0] Ro. 
3 9 34 Rs 
Solution Perform the operations indicated 
1 3 —3]R 
1 2 +3] 14 -- Κὶ -Ξ by the corollary to Property 4. 
1 2 3|Rs3— Re 
Example (iv) Evaluate 
1 1 1 
a ὃ ς 
aD ε2 
C, Cg Ὁ, 
Solution Perform the operations indicated 
1 0 0 (1 0 0 |e 
a b-—a c—a |=(b-alc—a)l\a 1 1 
a 053 --αὐ c?—@ αΣ b+a c+a 


Cc; Cy hay Cc; Cs — Cy 
= (ὁ — a)(c — alc — δ). 


8.4 THE ADJUGATE OR ADJOINT MATRIX 

Before the inverse matrix introduced in section 8.2.9 can be finally 
defined it is necessary to examine a square matrix called the adjugate 
or adjoint matrix of A = a,; written adj A and defined by 


Ay 421 ΚΕΝ ΒΡ ΒΟ An 
Αι) Ang 452... Ana 


Ai3 Agog Ags ..- Ang 
ss - p= 4y) (39) 


in Aan Asn as? δὲ Ann 
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where Aj; is the cofactor of the element a;,, of the ith row and jth 
column of the determinant |A|. Note that adj A may be obtained from 
A’, the transpose of A, by replacing the elements of A’ by their cofactors 
in |A’|. 


Example (i) If 


42 1 4 3 0 
A={3 0 1 then A’={2 0 1 
oO 1 2 1 1 2 


—1 —3 2 
and hence adj A=[-6 8 —I}. 
3 -4 - 
Let B be the matrix product of A and adj A then 
α1χ Gyo λα... Ara\ [Ary Aoi Agi... And 
Gq, 29 Ag.» « Aan} [412 Age 43... .. Ana 
Ani Ang Ang» + + Ann in Aon Asn Sis es. Aan 


and from the definition of matrix multiplication 
by, = @yAyz3 Summed over ἡ 
= δι. [4] 


in the notation of (36), section 8.3. The property of 0, shows that B is 
a scalar matrix and 


|A| 0 O 0 
0 |jAl O 0 
0 0 |A| 0 
A adj A = ‘ 
0.ϑὃ909 0... JA] 
= |All. (41) 


Using (38) of section 8.3, an exactly similar result follows for (adj 4)A 
so that 


A (adj A) = |A|I = (adj A)A. (42) 
Example (ii) If 


Be Js oe 
4= (i Ἵ «ὦ 4-ς ἰ 2) 
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and then 


Me Nae ae 


(adj A)A = Ge =) ( ᾿ -- ls 2) = 5]. 


Example (iti) If 


2 3 1 1 0 1 
A= ( 2 | and B= ( 2 ἢ 
012 1141 
show that adj (AB) = (adj B)(adj A). 
Solution 
3 7 6 0 3 --3 
AB= [ 4 Ὶ and adj AB= | 3 -3 =; 


2 4 3 4 2 5 


ie Pee) er 4 -- --2 0 3-3 
(adj Badj Ady={ 1 0 --Ι|[--Ὃὸ 4 1)=[ 3 --3 --3]. 
ΞΕ eee) a eee -4. 2 5 


8.5 THE RECIPROCAL OR INVERSE MATRIX 


In section 8.2.9 the inverse matrix of a square matrix A was written in 
symbolic form as A~* such that 


AA*=A 14 =1. (43) 
Comparison with (42) shows that (adj A)/|A| is identical with 477, ie. 


Anta adj A (44) 


~ {Al 
provided that |A| - 0. 

If |A| = 0, the matrix is called singular and the inverse A~* does not 
exist. 


3 1 2 
Example (i) Given A=|2 1 | find A7?. 
21 1 
Solution Here |A| = 2(0) + 1(1) = 1 so that A is non-singular and 
ἢ 1 1 —2 
A-}= —adjA=[-2 πὶ 4). 


[4] 0 -- 1 


211 
Example (ii) Given A=|0 11 | find A7*. 
6 4 3 
Solution Here [4] = 0, A is singular, and A~* does not exist. 
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8.6 SOLUTION OF REGULAR LINEAR ALGEBRAIC 
EQUATIONS 


Consider the system of » linear equations in » unknowns x, 
Ax = B (45) 
where A is the » Χ n matrix (a,,), x is the column matrix (x,) and B 


is the column matrix (6,). If A is non-singular the system (45) is called 
regular. With A non-singular, 4 1 exists and equation (45) may be 


written 
A-1Ax = 4:18 
ie. Ix = 4:18 
or x= 418 (46) 


The equation (46) is then the formal solution of (45). 


Example (i) To solve 2x, +x, —x,3 = 4 


I 
| 
Θ 


Χὶ — 2x2 + x 


— 3x, — 2x3 = 9 


2 1 -- ὟΣ 4 
= 1 —2 1], x= x2} and B= {-—10}. 


Since [4] = (—3)(—1) + (—2)(—5) = 13, then 


ἢ 4 2 -1 
4 a -“ἸΙ, -7 - 


write 


—6 -3 —5 
1 4 2 —-!1 4 1 —13 
and x= Τῷ —-1 —7 —3}{-—10 = 55 39 
—-6 —3 —5 9 —39 
Hence, x; = —1, x, = 3 and x, = —3. 


The solution (46) may be expressed in terms of determinants and 
cofactors. As an illustration consider the case n = 3, then 


1 1 Ay, Ao Asi\ (δι 

x = τ (adj A)B = [A] Aya 422 Asa]{ be 
Ay3 Ags 454 
1 (a + bzAar + oa 


3 


διΆ.. + θ2492 + θᾳΆ5. 
bi Ay3 + δ,Ά24 + ἘΞ θ54935 


|A| 
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where A,, is the cofactor of the element a,, in A. Hence, 


δι΄ Xa ἃς _ 
A, A; Gs {Al 
A 
or = Τῇ (47) 


where Δ, = b,A,, (summed over i). 

In equation (47), |A| is the determinant of the matrix A = (a,,) and 
A, is the determinant of that matrix obtained from A on replacing the 
jth column of A by the column (4,). The solution of equation (45) 
expressed in the form (47) is called Cramer’s rule. The result (47) may 
be extended immediately for the case n > 3. 


Example (ii) Solving example (i) by this rule leads to 


xy Xe Xs = 1 
Ta Oo eo 4.»Μ 712 1- 
--ἰὸ --2 1 1-10 1 1 --2 10 1- τ .1 
9 0-2 Jas 9-2 |-32 0 9] [|- 0 --2 


It is left to the reader to evaluate each determinant and verify the 
solution. 


8.6.1 Solution by the elimination method 

A general method of dealing with the system of equations (45) is to 
eliminate unknowns x, until one equation contains only one unknown, 
solve for this unknown, and then work back through successive equa- 
tions to find the other unknowns. A systematic method of procedure 
was suggested by Gauss. This method is sometimes laborious but being 
systematic, and of a routine nature, it is suitable for a computer 
program. 

Consider as an example the system of equations 


2x4 — Xe + 3x5 = 10 
x, + 3x, — 4x3 = —11 (48) 
4x, - χα — 2x3 = 3 
2 -- 3\ χὶ 10 
or 1 3 - xg} ={—11 (49) 
4 1 —2/ \xz, 3 
A x = B 
The matrices of (49) may be written in augmented matrix form (A, B) 
2 -1 3 10 
1 3 —4} —11 (50) 
4 1 —2 3 
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and because of the equality signs in equations (48) certain row manipu- 
lations may be performed on (50) without affecting the final solution. 
The three important row transformations are: 


I Interchange of rows, 
II Multiplication of a row by any non-zero constant, 
III Addition of rows. 


Apply these row transformations to (50) to make the matrix A into an 
upper triangular matrix, i.e. one with zero elements below the leading 
diagonal. The successive steps are indicated below. 


1 3 —4[—11\ R, 
Apply I, (: —1 3 ἢ Κα 
4 1 - 3) Κα 

1 3 --ξἱ] —11\ R, 
and then apply II and ΠῚ ( —7 1 =| R; 
0-11 14 47] Rj 
where Ry = Re — 2R,; Ry = Κα — 4R, 
1 3. —4|—11\ R, 
Apply II and HI again ( --7 11 »| Ry 
0 0 —23 | —23/ Κς 

where Ry, = ΤΕ: — 11R, 


and a final application of II leads to 


1 3 -—4] -- 
0 7 —11 {| —32]. 
0 0 1 1 


The solution may be written down working from the last row, i.e. 
X3 = 1, 7x2 — 1lx3 = ~—32, and x, + 3xq — 4x3 = —11 or x3 = 1, 
X, = —3 and x, = 2. In a practical problem the coefficients of a 
system of algebraic equations are usually decimals and it is convenient 
to use II to make the coefficients of the variable being eliminated equal 
to unity. The numerical details of this, together with other methods of 
solution, is given in Chapter 13. 


8.6.2 Alternative method of determination of the inverse matrix 4 1 
If A is a non-singular matrix then the equation 


Ax = IB (51) 
has the formal solution 
Ix = 4.18 (52) 


which suggests that if A is transformed into J by the row transforms 
I, If, and ΠῚ, then J will be transformed into A~* by the same processes. 
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The method is illustrated in the following example where ~ is a 
symbol used to indicate that successive matrices are obtained by row 
transforms. 


Example Find the inverse of 


Solution Write 


12 2/1 0 0\R 
wpn(las orga, 
13 3/00 11. 

ι 2. 241 Oar 

~(° tea io) 

0 0 =1| O -ἰ 1) Rk 


where R,= R; — Ro RZ=R2— Κι; RL=R 


1 0 Of 3 0 —2\R% 
~( ι 20 -ἰ ἜΣ 
0.010 1 Sy Re 
where RY = Ri -- 28: -- 21’; RZ τὸ RB, + 2K R= —R, 


3 0 - 
Hence, At=|[-—1 τὶ 2}. 
0 1 —1 


These steps are not the only ones by which (A, J) can be transformed 
into (1, A~*) and the reader is invited to attempt the transformation 
using different steps and also to verify that the result obtained is the 
same as adj A/| A]. 


8.7 FURTHER DISCUSSION OF THE SOLUTION OF a 
LINEAR EQUATIONS IN n UNKNOWNS 


The discussion of the system of n equations in m unknowns 
Ax =B (53) 


is given here without too much detail. The reader who requires a more 
formal treatment is advised to read this section and then to consult a 
text devoted exclusively to matrix theory. 


8.7.1 Regular solution, A non-singular. 


If the n x n matrix is non-singular, then |A| # Ὁ and it is evident that 
Cramer’s rule, 
x,=A/|Al, j=Hl,2,3,.. ἢ 


leads to n distinct values of x,, i.e. to a unique solution. In this case 
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the ἡ linear equations are said to be linearly independent. Some of the 
A, may be zero to give zeros in the corresponding x,, but the equations 
(53) cannot have a solution with all x, zero and A non-singular unless 
each A, is zero, and this implies that B is zero. The equation (53) then 
reduces to 

Ax =0 (54) 


This equation is called a homogeneous equation (the equation Ax = B 
being called non-homogeneous) and has a unique but zero solution if 
and only if A is non-singular. This solution of equation (54) is called 
the trivial solution. 


8.7.2 Singular solution, 4 singular 


Suppose the matrix A is singular. The |A| = 0 and the form given by 
Cramer shows that if the A, + 0 the x, cannot be found, i.e. no solution 
exists and the system of equations is said to be inconsistent. 


Example (i) The system 


Xi + X2=3 
2x1 + 2X9 ΞΞ 7 
is obviously inconsistent. Here [4] = 0, Δ; = —1, and A, = 1. 


Example (ii) The system of equations 

3x, + 2x, + x3 = 10 

2x, + 3x2 -- x3 = 5 

4x, + x2 +3x,= 8 
has no obvious inconsistency but |A| = 0, A, = 35, and Az, Ag are 
also non-zero. 


A closer inspection of the two examples shows that in the left side 
of each system one of the expressions is dependent on the others, for 
example 

(4x1 + χὰ + 3x3) = 2(3x, + 2x2 + X3) — (2x, + 3x2 — Xa), 
but that this is not the case for the right side, i.e. 8 4 2(10) — (5). 

Suppose now that all the A, are zero, then Cramer’s solution 
x, = A,|A| is of the form 0/0 which is indeterminate. 

Example (iii) The system 
3x, + 2x2 + x3 = 10 
2x, + 3x2 - Xg= 5 
4x, + X2g + 3x3 = 15 


has [4] = 0 and A, = A, = A, = 0 and there is no unique solution. 
In this example the last equation is a linear combination of the other 
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two and the system reduces to one of two equations and three unknowns 
3x, + 2X2 + X3 = 10 
2x1 + 3x2 — X3 = 5. 


The original system of three equations is said to be linearly dependent. 
One of the unknowns can be disposed of arbitrarily, say x3 = «, and 
the others expressed in terms of «, i.e. 


χιτε4-- α, χετξεαὰα-- 1, x= ou 


Note that there is a solution to the system of equations although 
indeterminate to the extent that « is arbitrary. Any system that possesses 
a solution regular or indeterminate is called consistent. 

To sum up, a system of equations is either inconsistent (|A| = 0, 
A, #0) or consistent and only the latter possesses a solution. A con- 
sistent system may have a unique solution (i.e. one fixed set of values 
for x,) in which case the solution is called regular or it may have an 
indeterminate solution (i.e. an infinite number of different sets of 
values x,). An illustration in which one variable may be assigned 
arbitrary values is given by considering the single equation 2x, + 3x, 
= 6 with two unknowns. Geometrically this is the equation of a straight 
line in the plane Ox,x2 and the solution x, = 3 — 3a, x, = α gives 
the coordinates of an arbitrary point on the line. Similarly, the pair of 
equations above 


3x, + 2x2 + x3 = 10 
2x, + 3xg—x,3= 5 


may be geometrically interpreted as the intersection of two planes in 
three dimensions (see section 9.7). The solution 


X,=4-—a, x,=a-—1, χετξξ α 


gives the coordinates of the general point on the common line of inter- 
section of the planes. Different values of « simply give different points 
on this line, e.g. (4, —1, 0), and (10, —7, —6) are two such points. 

It is necessary to consider the general case. Without proof, it can be 
stated that the system is consistent if and only if r of the equations 
are linearly independent where r is any non-zero positive integer such 
that r <n and then (n —r) of the unknowns can be disposed of 
arbitrarily. In order to investigate a system of equations the elimination 
method of section 8.6.1 may be used. Any inconsistency in the system 
will appear in the equivalent form of the augmented matrix (A, B). 
The following examples should illustrate the process. 


Example (iv) Discuss the system of equations 
2X, + Xg + 5x3 =1 
Xy — 3x_g + 6x, = 2 
3x, + 5x2 + 4x3 = 0. 
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2 1 SV iy 1 
(= ἡ τ 
3. 5. 4: 0 
1 1-3 6/2\R 
ἜΣ Ι 5 ἢ 
0 3 5 4/0/R, 
3 
7 


1 -3 6 2 1 - 6 
~ 10 7 —7| —-3]~ {0 —7 
0 14-14] - 0 0 0 


2 
-3. 
0 
Κι, = 2Ri; Rg oe 3R, R’; = 2R’, 


The last row of zeros shows that the system is consistent but that one 
of the equations is linearly dependent on the others. One unknown may 
be arbitrarily assigned, let xg =a then 7x, = 7a—3 and 7x, = 
5 — 21α. Note that here n = 3 and r = 2. 


Solution Write 


21 5 
and (4, 8)--[1 —3 6 
3. 5. 4 


Example (v) Discuss the system 


2 1 S\ [x1 1 
1 — 3 6 X2 = 2 . 
3 5 4] λχᾳ 1 


Solution In this case 


ι -32: 64ᾳ. 2 
(4,B)~{0 7 --Ἰ} --3 
0 o of] 1 


and the last row shows that the system is inconsistent. Assuming that 
x, and Xg are non-zero the equation corresponding to the last row is 
Ox; = 1 which is impossible for finite x3. The system has no solution. 


Finally when B = 0 the system is homogeneous, Ax = 0 and the 
system is always consistent. If |A| #0 there is only one solution 
x, = 0 (see section 8.7.1) called the trivial solution but if |A| = 0 then 
ΟἹ — r) of the variables may be assigned arbitrary values where r is an 
integer such that 0 < r <n, and the remaining r variables obtained in 
terms of these assigned values. Thus the condition that the equation 
Ax = 0 has a solution other than the trivial one is that |A| = 0. 


Example (vi) Solve 
3x, + χο 4+2x3-— x, =0 
Χὶ το χὰ -τ Xe Ξε 0 
4x, + χρ-- x,=0 
9x1, — Xg + X3 — 2x, = 0. 
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Solution The matrix is transformed as follows 


3.1. 2 -| [a Sa, οὐ 
pce SE ON 1S. Ae) A 
4.0 1 -| 40 1-]! 
9. - 1 - 9 -1 1 - 

1-1 -1 0 Ι- -ὶ 0 

OF 4 5 - 0 4 5 =I 

TNO: 4 5 -|7|Ὸ ὁ ὁ 0 

0 8 10 -2 0 0 0 0 


and the last two rows show that the system has two equations that are 
linearly dependent. Two of the variables may be assigned arbitrary 
values, let x, = α and x, = f then 4x. = β — 5a and 4x, = β — α. 


8.8 THE SOLUTION OF A GENERAL SYSTEM OF m 


LINEAR EQUATIONS IN 2 UNKNOWNS 
When m = n the system reduces to that already discussed in section 8.7. 


Consider now the two cases m << ἡ ἀπά γι)» ἡ. 


8.8.1 Case m <n 
Given the system 
Ax = B (55) 


A is an m Χ n rectangular matrix, x is an n Χ 1 column matrix and 
B is an m X 1 column matrix and there are more unknowns x, than 
there are equations. The system (55) is consistent if and only if r of the 
equations are linearly independent where r is a non-zero positive integer 
such that r <m. In the solution, (” — r) of the unknowns may be 
assigned arbitrary values and the remaining r variables found in terms 
of these assigned variables. 


Example (i) Examine the system 
3x1 + X2— χαρτὶ xX, = 4 
Χι + 2x3  2χ, ΞξΞῷ 5 
3x2  χᾳ4-τ xX, =3 
in which m = 3 andn = 4. 


Solution Proceeding by the elimination method 


4 1 0 
it 
3 0 3 
1 0 2 2 
~( 1 - —5 
0 


0 


3. oe 1 
(4(.,Β-[1 0 2 2 
Ο °F. 1 -ῷῳι 
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The equivalent form of (4, B) shows that the system has no inconsis- 
tency, the equations are linearly independent, and r = m = 3. One 
(n — r = 1) of the variables may be arbitrarily assigned. Let x, = « 
then llx, = 18 — 7a, llxg = 5 + 6a and 11x, = 19 — δα. 


Example (ii) Examine the system 
Xy +X. πο χα -- X= 1 
2x1 — Xe +x%,=2 
3x1 — Xs = 


in which m = 3 and n = 4. 


Solution 
1 1-1 ~—1 {1 1 1 πὶ —1/1 
(A, B) = ( -1 0 1 Ἶ ~ ( -3 2 3 ἢ 
3 0 -ι 013 0 --3 2 310 


0 0 oO 0 

Rg τὸ Ro; 3Ry + Re 
which shows that the system is consistent but that one of the equations 
is linearly dependent on the other two, 1.6. r = 2. Then two (n — r = 2) 


of the unknowns may be assigned arbitrary values. Let x, = «, X3 = β, 
then 3x, = 26 + 3« and 3x, = 3 + β. 


Example (iii) Examine the system 
Xy— 2χ, το χα — X= —4 
3x, + Χο x3 —2x%,= Il 
Xy + 12xg + 6x3 + xX, = 42 


in which m = 3 andn = 4, 


Solution 
1 —-2 -- πὶ} - 1 -Ὦ2 —1 --} - 
(A, Β) = ( i 1 --2 i] ~ (° 7 4 1 =| 
1 12 6 1] 42 0 14 7 2; 46 
1 - —1 —1{— 
~( 7 4 1 | 
0 0 --ι 0 0 


and the system is consistent, r = m = 3 and one (ἢ -- r = 1) of the 
variables may be arbitrarily assigned. The last equation formed from 
the equivalent matrix shows that x3 = 0, let x, = a, then the solution 
is X4 Ξε αἱ χα = 0; 7χ. = 23 — « and 7x, = 18 + 5a. 
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Example (iv) Examine the system 
Xy— 2X%,—- X3—- x, = —4 
3X, + ΧΑ X3 —2x,= I 
Xy + 12xg + Txg + Xs 31 
in which m = 3 and n = 4. 


Solution 
1 -- -1 -1 
(A, B) = ( 1 1 —2 
1 12 7 1 


0 0 


= - 
4 1 
8 2 
ae τοῦ 
4 1 
0 0 


and the equivalent form of the matrix shows that the system is incon- 


sistent. 


8.8.2 Casem>n 


In this case the system (55) has more equations than unknowns. The 
system is either inconsistent in which case there is no solution, or it is 
consistent and the equations are linearly dependent. 

If the system is consistent then, as in section 8.8.1, r of the equations 
are linearly independent and (7 — r) of the unknowns may be assigned 
arbitrary values. The non-zero integer r is such that r <n. 


Example (i) Examine the system 


2x, + x, = 7 
X+ X= 5 
4x, + x2 =11 
5x1 τ X,= 7 
3x, —2x,= 0 
in which m = 5 andn = 2. 
Solution 
1 1| 5 1 Ι 
2 1] 7 0 --ἰ 
(4,B)~]4 1|1 [0 --3 
5 —-1!1 7 0 —6 
3 -Ξδ͵᾽ 0 0 —5 
1 1| 5 1 0 
0 --]-3 0 1 
~ 10 0; OF ~TO 0 
0 010 0 0 
0 0.90 0 0 


oo OWN 
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which shows that the system is consistent and that three equations are 
linearly dependent on the other two, ie. r= 2, n—r=0. Finally, 
Χῳ = 3 and x, = 2. 


Example (ii) Examine the system 


2x, + x, =5 
3x, + 2x, = 3 
Xy+ xX, =2 


in which m = 3 and n = 2. 


2 115 ie at 
(4, 8Β)-[3 2/3)~ [0 -ἰ 
fe 12 0 0 


The last row shows that the system is inconsistent. 


Solution 


2 
1 
Example (iii) Examine the system 
3x1 — Xe + X3 = 1 


Xy + 2x2— X3 
7x, = 1X2 + 5X3 τι --Ἰ 


12x, + 3X2 = 
71x, + x3= 4 
in which m = 5 and n = 3. 
Solution 
3 -1 1 J 1 2 -- 2 
1 2 —1 2 3 --ὶ 1 1 
A. Bye 17 ST Saat pee aT S| 1 
12 3 0 9 12 3 0 9 
7 0 1 4 7 0 1 4 
1 2 --| 2 1 2 --|Δ 2 
0 —7 4 —5 0 —7 4 -- 5 
~{TO-21 12] —I5]~ 0 0 0 0 
0 7 -4 5 0 0 0 0 
0 7 --4 5 0 0 0 0 


At this stage it may be seen that the system is consistent, that three of 
the equations are redundant, i.e. r = 2, and that one (n — r = 1) of 


the unknowns can be given an arbitrary value. Let x3 = a, then 
7χ. = 4a + 5and 7x, = 4— α. 


Finally, the condition that a system of (n + 1) equations with n 
unknowns is consistent may be expressed as a simple determinant. 
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Consider, for simplicity, the case of 3 equations with 2 unknowns, i.e. 
Q11X%1 + AyoX_ = δὶ 
Qg1X1 + AgoXq = be 
431X1 + AgoX2 = bg. 

Replace δι by tb; and the system may then be considered as a homo- 
geneous one of three equations with three unknowns, x,, x2, and ¢. 


From section 8.7.2 the homogeneous system has a non-trivial solution 
if and only if the determinant of the coefficients is zero, i.e. 


41 Qo by 
Qo, og be 
43, 32 bg 


= 0. (56) 


Since ἢ = 1, this must be the necessary condition for the original three 
equations in two unknowns to have a unique solution. The condition 
(56) can be immediately extended to the general case of n unknowns. 


8.8.3 Summary 


A system of linear equations Ax = B in n unknowns may be tested for 
consistency by carrying out a process of transformation of the matrix 
(A, B). The method involves an elimination of successive unknowns 
from the equations of the system by an application of row manipula- 
tions to the matrix (A, B). The process concludes when the matrix 
(A, B) has been replaced by the equivalent matrix (C, K) where K is a 
column matrix and C is an upper triangular matrix, i.e. one whose 
elements below the leading diagonal are all zero. Any inconsistency in 
the system will be shown by a row of (C, K) having all its elements zero 
except the element of the column K. If the system is consistent then the 
number of rows of (C, K) whose elements are not all zero is the number 
of equations of the system that are independent. The difference between 
the number of unknowns and the number of independent equations 
determines the number of unknowns that may be assigned arbitrary 
values. 


EXERCISE 8 

1A=(a,), B= (6;,), and C = (c,:) are matrices ΟἹ order 7 x 3, 
3 x 4, and 4 x 5 respectively. The product ABC is given by 
(4;;b;,C,,). Write out in full the element of the 4th row and the 
2nd column of ABC. 


3-2 
2 Given A = 1 4), B= ial and C= ae : 
167 2 0 -1 2 


(i) evaluate (AB)C and A(BC), 
(ii) verify that A(B + C) = AB + AC, 
(iii) find the matrix M such that B? + 203 + 2M=0. 
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3 Given matrices A = (a,,) and B= (b,,) of orders m xn and 
n X p respectively, write down: 


(Ὁ the element of the ith row and the kth column of AB, 
(ii) the element of the kth row and the ith column of (AB)’, 
(iii) the elements of the kth row of B’, 
(iv) the elements of the ith column of A’. 


Show that (AB)’ = 8 4'. 


Dil 
4 Verify (AB)' = B’A’' for A = ( i) B= (3). 


1 0, 2 
1 0 0 0 
1 - 0 0 
5 If A= 1-2 1 0 
1 —3 3 -l 


show that (i) A? = J 
(ii) PQ = OP 


where P = AD,A, Q = AD,A, and D,, Dz are arbitrary diagonal 
matrices. 


6 Write down the cofactors of the elements fand k in the determinant 


a ὃ c 
de f 
g Aki. 
2 3 -ἰ 
7 Evaluate | 1 0 2 
4 2 1 


(Ὁ using cofactors of the first row, 
(ii) using cofactors of the second column, 
(iii) by the rule of Sarrus. 


3 1 —2 
8 Given that | 8 —5 7\= —35, 
4 0 1 
state, without expansion, the value of the following determinants 
3 8 4 3 —2 —2 3 1 —2 
(i) 1-5 Oj; @;8 7 #10; G@Dl4 -5ὁ. 6 
—2 #7 1 4 1 0 4 0 1 
(iv) il 1 -2} (v)|3 3 10 
—20 -5 7 8 —15 —35 
0 860 1 4 0 —5 
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2 1 --Ι 1 

1 0 2 3 

9 Evaluate 1 Δ 1 3 
—1 4 0 —1 


(i) by first forming the equivalent determinant whose elements of 
the second row are (1, 0, 0, 0), 

(ii) by first forming the equivalent determinant whose elements of 
the third column are (—1, 0, 0, 0). 


10 Find the inverse of the matrix 


by two different methods. 
11 Given the system of equations 
(i) 2x, + x, + x= 6 (ii) 2x, + Xe + X3= 6 
Xy + 2x2 + x3 = 10 Xy + 2x2 + x3 = 10 
10x, — xe +3x,= 2 7x, + 5xqg + 4x3 = 7. 


Show that one is inconsistent and find the solution of the other. 
12 Solve, if possible, the systems 
(i) X, — 3x2 + 4x3 -- x, =3 (ii) 3x, — 2x2 + X= 0 


xy -- %*3+2x,=1 2X, + Xo + 4x5 = 28 
Xy — 8x2 + 11x3 — 7x, =7 X, — 3X2 + 2x3 = 7 
X_g— 3X3 =0 4x, + x3= 3 
(iii) 2x, + 3x,= 7 (iv) x1 + 2x, + x3 — x, = —2 
4x, + 5x, = 1 3X1 — X_ + 2X3 = 2 
2x, --- X,=-—19 2X, + 5x.+ xx +x%,= 4 
2x, + 4x2 = 5 


13 Find the condition that ax? + bx + c = 0 and px? ++ qx +r=0 
have a common root. 


[Hint: Form 4 equations in 3 unknowns x?, x2, x and use the con- 
sistency condition.] 


9 
Vectors 


9.1 SCALARS 
Certain physical quantities are completely described by a magnitude, 
or ἃ measurement in a given system of units, for example, a volume, a 
mass, a temperature, or a speed. Such quantities are called scalars and 
may be manipulated using the basic laws of algebra, which for scalars 
a, B, and μ are: 


The law of commutation 


Gat p=p+a 
(ii) «B = Bau 


The law of association 


ὦ ἃ Ἐβ᾽ Ἐμξα Ἐ(6 Ἐμωὼ 
(ii) (αβὴμ = α(βμὴ = αβμ 


The law of distribution 
α(β + μὴ) = αβ + ap. 


9.2 DIRECTED MAGNITUDES 

Other quantities are not completely defined by a magnitude but require 
also a precise indication of the direction in which the measurement is 
taken. Typical examples are, a displacement, a force, a velocity, or a 
finite rotation. In the latter case, the direction is indicated by the axis 
of rotation and the magnitude by the angle of rotation. Such quantities 
may be conveniently represented on a figure by a line segment in the 
requisite direction and of a suitably scaled length to represent the 
magnitude. 

It is not necessary that these directed magnitudes have a prescribed 
point of application, for example, if a solid body moves without 
rotation in a straight line then every point of the body has the same 
velocity. Such quantities are said to be free. If a directed magnitude 
has its point of application prescribed as, for example, a given force 
acting at a point of a body then it is said to be bound. 


9.2.1 Notation 


When directed magnitudes are indicated on a diagram it is convenient 
-to use letters to label the figure and a directed magnitude may then be 


.-.--- -» 
represented by a line segment OP, OP, or OP in Clarendon type. 
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The notation OP means that the direction of the measurement is 
from O to P and that the magnitude is equivalent to the length from 
O to P. The direction in which a vector acts should be emphasized by 
adding an arrow to the line segment as in Fig. 9.1. 


Q 
/ | 
᾿ a 
ο 
Fig. 9.1 
9.2.2 Equality of directed magnitudes 


In Fig. 9.2 the two directed magnitudes OP and OR are equal, i.e. 
OP = OR, if, and only if, the lengths OP and QR are equal and they 
are in the same direction (note this means that not only are the lines 
of action parallel but that the magnitudes are directed in the same sense). 
If two directed magnitudes have the same magnitude but are in opposite 
directions, as are OP and ST in Fig. 9.2, then this is indicated by 
writing OP = —ST. In particular, OP = —PO. A null or zero directed 
magnitude is one of zero magnitude in any direction. 


fL/ 


Fig. 9.2 


9.2.3 Addition of directed magnitudes 
Suppose a man walks i kilometre due East from a given point and then 
walks in a direction N 30° E for 1 kilometre, his path may be represented 
by the displacement diagram Fig. 9.3 where O represents his starting 
point, Q his final position, and P his intermediate position. 

His path represented by OPQ is equivalent to OQ, i.e. walking +/3 
kilometre in a direction N 60° E from his initial starting point. The 
equivalence of paths may be formally written 


00 = OP + PQ (1) 
and called the triangle of addition. 
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Fig. 9.3 


It is immaterial in which order the directed magnitudes of displace- 
ment are taken because, assuming it to be physically possible, the man 
could have walked on a path represented in the figure first by the dis- 


placement OR and second by RQ. From the definition of equality, 
OR = PQ and OP = RQ and it follows that 

OQ = OP+PQ=OR+ RQOQ=PQ+O0P=OR+O0P (2) 
When the parallelogram OPQR is completed, equations (2) state that 
the sum of OP and OR is given by the diagonal OQ. This is called the 
parallelogram law of addition. Equations (2) also show that displace- 
ments obey the commutative law of addition. In the example given the 
various paths taken all lie on the surface of the Earth. The distances 
involved are so small compared to the radius of the Earth that the 
curvature of the latter may be neglected. The points represented by 
O, P, Q, and R then all lie on a plane. In comparison see example 2 of 
Exercise 9. 

However, all directed magnitudes do not combine according to the 
triangle or parallelogram law of addition. Consider for example the 
addition of two finite rotations as follows. A point P (see Fig. 9.4) is 
situated at the position whose Cartesian coordinates are (1, 0, 0) and 
the line OP is subjected to a rotation (R,) through an angle of πί2 
about the axis of Ox, and to a rotation (2) through an angle of 7/2 
about the axis of Ox,. The two rotations in the sense indicated in the 
figure can be represented by directed magnitudes OB and OA (where 
the lengths OA and OB are both 7/2). The rotation (R,) followed by 
the rotation (R,) makes P move, via Q, into S and is therefore equiva- 
lent to a single rotation about the axis Ox.. 

If, however, the order is reversed so that rotation (9) is followed by 
rotation (R;) it is seen that P moves to Q. The conclusion is that in this 
example the sum OA + OB is not the same as the sum OB + OA; the 
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directed magnitudes of finite rotations do not obey the parallelogram 
law of addition or the commutative law of algebra. 


Pe 


Ὁ (1,0,0) 
AN 
χ- 

/ 


/ 
1X4 


Fig. 9.4 


9.3 VECTORS 


A directed magnitude that obeys the parallelogram law of addition so 
that, in Fig. 9.3, 

00 =OP + OR=OR+ OP 
is called a vector. Examples of vectors are displacement, force, velocity, 
acceleration, and angular velocity. From the definition it follows that 
a finite rotation is not a vector. In addition to the notation of section 
9.2.1, a vector is frequently denoted by a single letter such as a or b 


printed in Clarendon type. The non-negative scalar magnitude of a 
vector a is then denoted by [4] or simply a in italic type. 


9.3.1 Equality of vectors 


Two vectors a and b are equal if, and only if, a = ὃ and the directions 
are the same. If a = ὁ but the directions are opposite then a = —b. 


9.3.2 Zero vector and unit vector 

A null or zero vector is of zero length and has no particular direction; 
it is frequently denoted by 0. A unit vector is one whose magnitude is 
unity. 

9.3.3 Addition and subtraction of vectors 

If more than two vectors are added together then the addition law is 


applied successively as illustrated in Fig. 9.5 until a single vector (the 
resultant) is obtained. 
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It may be easily shown, using the properties of parallelograms, that the 
order of continued addition is immaterial and that 
R=(a+b)+ce=a+(b+0oc) 


so that vectors obey the associative law of addition. The vectors of 
Fig. 9.5 do not necessarily lie in one plane. 


Fig. 9.7 


The difference of two vectors a and b is simply the sum of the vector 
a and the vector (—b). In Fig. 9.6, a is added to (—b) using the parallelo- 
gram rule. 
If (a — b) is a free vector then it may be moved to any parallel position 
and in Fig. 9.7 the sum and difference of two vectors a and b are repre- 
sented by the diagonals of the parallelogram with sides a and b. 
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(Note that the vector a — b is in the direction from the end of b to 
the end of a.) 
Example (i) If the vectors a and b are inclined at an angle 6, then 
[8 — b] = (a? + 5? — 2ab cos 8). 
and ja + b| = +/(a? + b? + 2ab cos 8). 
Solution This follows by an immediate application of the cosine rule to 
the triangles of Fig. 9.7 where angle AOB is 0. 
Example (ii) Prove the inequalities 
|a — δ] < [4] + [bj 
and |a — bj > ||a] — [bj]. 
Solution In the triangle OAB of Fig. 9.7 it is clear that 
BA <0OA+OB 


and the first result follows immediately. 
Further, if a > δ write 
OA <OB+ BA 


or BA > OA — OB, 
1.6. ja — Β] > [al — |b]; 
if a < b write 

OB <OA+ AB 
or AB > OB — OA, 
1.6. ja — b| > |b] — fal. 


The second result then follows. Obviously both equalities cannot hold 
at the same time unless either a or b is the null vector. 


Example (iii) A car ferry crossing a sea loch travels 0-5 kilometre 
in 3 minutes. The ferry is observed to move at an angle of 60° to the 
current which flows uniformly at 5 km.p.h. Find the velocity of the ferry 
relative to the water. 


Solution In Fig. 9.8(a) the velocity vy of the water compounded with the 
velocity V of the ferry relative to the water must give the resultant 
velocity ἃ in the direction AB, or 4.82. The magnitude of the velocity αὶ 
is 10 km.p.h. (given). 

The vector diagrams in Figs. 9.8(b) and 9.8(c) show that the required 
velocity in each case is u — v. If the ferry travels from A to B,, then 


V=V, = VS? + 10? + 2.5.10 cos 60°) 
= 54/7 km.p.h. 
and tan 6, = 10 sin 60°/(5 + 10 cos 60°) 


4/3/2 
ie. 6, = 40° 54’, 
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If the ferry travels from A to B, then 
V= Vz, = V(5S? + 10? — 2.5. 10 cos 60°) 


= 54/3 km.p.h. 
and tan 0, = 10 sin 60°/(5 — 10 cos 60°) 
1.6. 0, = 90°. 
Be By 


\ 
\ 


7 
Ἃ μ υ / Vv 
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Fig. 9.8 


9.3.4 Multiplication of a vector by a scalar 


If A is a scalar quantity then Aa is a vector of magnitude |Aal, i.e. |Ala 
whose direction is that of a if A is positive and that of —a if A is negative. 
The Fig. 9.9 shows a free vector a together with three vectors 2a, —3a, 
and 4a. 


ys 


If a is an arbitrary vector then a/|a| is a unit vector in the direction 
of a. A notation in common use is to reserve 4 for this vector. Thus 


Fig. 9.9 


τ ἃ or a= |ala. (3) 
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Using the properties of similar triangles and simple proportion it 
may be shown that if 2 and μ are two scalars and a and b are two 
vectors then 


μ(λα) = μλᾶ = λίμ8) (4) 
(A+ μὴ = dat pa (5) 
Aa + b) = da + Ab (6) 


Equation (4) is the associative law and equations (5) and (6) form the 
distributive law for multiplication of a vector by a scalar. 


Example Express in vector notation that (a) P is the mid-point of OA, 
(b) CD is parallel to AB, three times as long as AB, and in the opposite 
sense to AB. 
Solution (a) OP = 404A, and vector OP has the same line of action as 
OA. Hence, OP = 104. 

(b) CD -- 348 and hence CD = —3AB, the opposite direction 
being indicated by the negative sign. 


9.3.5 Components of a vector 
Given any vector r, applying the addition rule of section 9.3.3 in reverse 
it is possible to write 

r=a,ta,+...+a 
where the n vectors a, and the vector —r when put head to tail form 
a closed polygon as in Fig. 9.10. 


| 
\ 
Ι 
Ι 
Ι 
Ι' 
| 
! 


Fig. 9.10 


This polygon is, in general, a three-dimensional one. The vectors 
1, Ag, . . ., 8, are called the component vectors of τ. Any ΟἹ — 1) of 
the vectors a, may be chosen arbitrarily but the mth vector must close 
the polygon. 
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A case of particular interest arises when the vector r is expressed as 
the sum of three vectors parallel to three given non-zero, non-coplanar 
vectors a, b, and c. In Fig. 9.11, let OR represent the vector r and con- 
struct the parallelepiped with OR as diagonal and the sides OA, OB, 
and OC parallel to the given vectors a, b, and ς respectively. This 


C 


Fig. 9.11 


construction is unique, that is, only one such parallelepiped can be 
drawn and 


r=OR=0A+4AD+DR 
= 0A + 0OB+0C 
= λὰ + wb + ve 


where A, μ, and ν are determinable scalars. In particular if a, b, and ς 
are unit vectors then A, 4, and » are the number of such units in OA, 
OB, and OC respectively. The component vectors of r are Aa, ub, and 
ye but the scalar quantities A, μ, and ν are frequently referred to as the 
scalar components of the vector r or, when no ambiguity arises, simply 
as the components of r. 

The case in which the vectors a, b, and ¢ are mutually orthogonal is 
of special importance because the three directions of these vectors can 
be identified with those of the orthogonal Cartesian coordinate axes 
Ox, Oy, and Oz. Unit vectors in these directions are usually indicated 
by i, j, and k respectively. If r is a displacement vector then using 
Fig. 9.12, r may be written 


r= xi+ yj + 2k 


where (x, y, z) are the coordinates of the point R relative to the point 
O. If O is the origin of the coordinates then the displacement r is called 
the position vector of the point R. 
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If V is any vector it may be written in the form 
γεν τ V.j+ Vk 


where V,, V2, and V, are the orthogonal (scalar) projections of the 
vector V on to the coordinate axes. 
An alternative notation commonly used when the Cartesian axes are 
in the form (x, X2, Χ4) is to replace (i, j, k) by (e1, 62» eg) and then 
3 


Γ Ξε X,@; + X22 + Xseg = Σ᾽ XC; 
t=1 


and Vv = Vie, + Veo + Ves = > V ej. 
t1 


There is only one rectangular parallelepiped that can be constructed 
with sides parallel to i, j and k and with a given diagonal OR so that 
the resolution is unique and it follows that if two vectors V and U are 
equal then their components are equal. 


Ζ 


Fig, 9.12 


Thus given that V = V,i+ Κα) ἘΚ and U = U,i + U,j + U;k are 
equal, so that V = U, then 


V, = U,, “ΠΕ = U2, and V3 = Us. 


Note 1 An even further abbreviated notation is to write r = x,e;, the 
convention being that a repeated index means a summation taken 
over all possible values of that index. 


Note 2 A vector may be conveniently represented by its components 
in the form (%;, Xo, X3) OF 


x1 
Xe 
x3, 
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providing that there is no ambiguity or confusion. The vector com- 
ponents then form a 1 x 3 row matrix or a 3 x 1 column matrix 
and may be used in conjunction with other matrices. For example if 
xX = x,e, and y = y,e; and A is the 3 x 3 matrix a,; then 


y = Ax 
is the vector equivalent to the matrix equation 
y = Ax 
of example (ix) of section 8.2.6 where y™ = y’ = ( V1 Yo Va)s 
x? = x’ = (x, x2 X3) and 
41, Ayq (138 
A= βι 22 a 


431 (32 (88 


9.3.6 Direction cosines 


Suppose a general vector V is represented by PO as in Fig. 9.13 and 
that V = Vii + Vj + Vok, then by Pythagoras’ theorem 


V = |V| = PQ = (PB + BQ?) 
= (PA? + AB? + BQ?) 
= V(Vi + V3 + V2) (7) 


Fig. 9.13 
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The direction of PQ can be specified by the angles that PQ makes with 
the coordinate axes Ox, Oy, and Oz (or with PA, PC, and PD). These 
angles are usually designated «, 8, and y respectively and are called the 
direction angles. The angles are measured positively in the sense indi- 
cated in Fig. 9.13. 

It is, however, the cosines of these angles that frequently occur and 
these are designated A, μ, and » where 


A = Cos « 
= cos β (8) 
vy = cosy 


and called the direction cosines. From the right-angled triangles of 
Fig. 9.13 it follows that cos « = PA/PQ, cos B = PC/PQ, and 
cos y = PD/PQ or 


cosa = V,/V 
cos B = V2/V (9) 
cos y = V3/V 


Since the position of Q relative to P is uniquely determined by only 
3 quantities (viz. Κι, V2, and V3) and 4 quantities have been introduced 
(viz. A, μ, ν, and V) there must exist some relationship between these 
quantities. Using equations (7), (8), and (9) leads to 


2+ p2+7=1 
or cos? « + cos? B + cos? y = 1 (10) 


Instead of the actual direction cosines it is often sufficient to use any 
three quantities proportional to the direction cosines. Let these be 
1, m, and ἢ so that 


l=ki 
m=ku (11) 
n=ky 


The quantities defined by (11) are called direction ratios and are often 
written in a square bracket {/:m:n] or [/, m, n]. In this notation, [1, 2, 3] 
would be the direction ratios of a vector whereas (1, 2, 3) would be 
the Cartesian coordinates of a point in space. 


From (10) and (11) 
k= VP + m? + n?) (12) 
and hence 
A=1/JS(? + m3 + 7?) 
p= my? + m? + πὴ 
von[ JP +m +n). 
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To obtain the direction cosines from the direction ratios it is therefore 
necessary to divide by +/(/? + m? + n?). In particular, if 


V = Vii + Voj + Vk 


then [V,: V2: V3] may be used as direction ratios. 


Example (i) Find the sum of the two vectors 3i+7j —4k and 
6i — 2j + 12k and calculate its magnitude and direction. 


Solution Sum vector = R = 9i + 5j + 8k 
Magnitude = R = +/(81 + 25 + 64) 
= 4/170. 


Direction ratios are [9:5:8] and the direction cosines are (9/+/170, 


5/+/170, 8/4/170). 


Example (ii) Find the direction of the line PQ where P is the point 
(3, —7, 8) and Q is the point (2, 5, 5). 


Solution If O is the origin then in vector form 


OP = 3i — 7] + 8k 


and ΟὉ =21+ 5j+ 5k 
and hence PO = OO — OP 
= —i + 12] — 3k. 


The direction cosines of the line PQ are then 
(—1//154, 12/4/154, —3/+/154). 


Example (iii) Three forces Εἰ = 2i+ 3] + 2k, Ἐς = 2] — 3k, and 
Ἐς = 4i — 2] —k act on a point mass. Find the resultant force. 
Calculate the angles it makes with the coordinate axes. 


Solution Let the forces act on the point mass at the point P in Fig. 9.14. 


Ρ 
Fig. 9.14 


The resultant R = F, + F, + F, 
= 6i + 3j — 2k. 
The magnitude of R is +/(36 + 9 + 4) or 7. 
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The direction cosines are (6/7, 3/7, —2/7) and the direction angles are 
a =cos-*(6/7) = 31°, 
B = οος" 1 (3.7 = 64° 37’, 

and y = οοβ" 3 (—2/7) = 106° 36’. 

(Note here the value of y corresponding to the negative sign for cos y.) 


Example (iv) Prove that the centroid of any triangle divides the median 
in the ratio 2:1. 


a 
Fig. 9.15 


Solution In Fig. 9.15 let M and N be the mid-points of AC and AB 
respectively of triangle ABC and let G be the centroid. 
Let CA = band CB = a, then BA = b — a, BN = }(b — a), CM = 3b. 
Let CG = «CN where « is a scalar, then 
CG = «(CB + BN) 
= a(a + 3(b — a)) 
a 
= 5 (a + bd). 
Let BG = ΒΜ, then 
BG = f(BC + CM) 
= β(--ἃ + 4b). 


Now since BG = BC + CG = —a + CG, it follows that 
x 
B(—a + $b) = —a + 5 (a +b) = R (say). 
The vector components of R are unique and hence 
a 
—B=-1+ 5 
β 


and 2" 


NIR 
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from which it follows that 
a= Bp = 3. 


The result then follows immediately. 


9.3.7 The transformation matrix 


The transformation matrix of section 8.1 can now be given a geo- 
metrical significance. Suppose that P is a point that has Cartesian 
coordinates (b,, bz, bs) when referred to a frame of axes Ox,x2x3 and 
has coordinates (B,, Bo, Bs) when referred to a second frame Oy, yoys 
obtained from the first by a rotation about O as illustrated in Fig. 9.16. 


ρ (by, ba, bg) 
@ (8), Bz, Bs) 
ai 


x2 


[ 
{ 
\ 
Fig. 9.16 
From sections 9.3.5 and 9.3.6 it follows that 
B, = sum of projections of b,, bz, and bg on to Oy, 
= δι cos (x,Oy;) + be cos (x20y1) + ὃς cos (xsOy1) 
= 4,b, + Habe + 1b; 


where [A,;, μιν» ¥;] are the direction cosines of Oy, referred to the frame 
Οχιχοχᾳ. Likewise 


Bz = λεδι + Hebe + ναῦς 
and Bs = A3b, + μεθ. + νοῦς 


where [Az, fa, να] and [A3, 443, ¥3] are the direction cosines of Oy, and 
Oyg respectively. 
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In matrix notation 
By Ay fy νὼ (ὃ. 
B, = he be Vo be . (13) 
Bs As μα a) \bg 
In an exactly similar manner a relationship can be derived in the form 
by Ay Ao Ag 8. 
ba} = μι μὲ Ms }[ Be (14) 
bs » Vg V3 B,, 
where [A,, A, Ag] are the direction cosines of Ox, referred to the frame 
Oyxyay3- In a more concise notation the equation (13) may be written 
B, = ajjb; 
or B= Ab 
where A = (αι) is the transformation matrix and 
ἄμ = Cos (x,Oy,). 
Similarly equation (14) may be written 
δι = a,,B, 
or b= 4Ἐ 
where A’ is the transpose of A. 


P (by, 62, bs) 


Fig. 9.17 


Thus the transformation matrix (a@,,) represents the rotation of the 
frame of reference Ox,x,x3 into the frame of reference Oy, yzy3 and 
has elements which are direction cosines of one frame with respect to 
the other, i.e. it specifies the magnitude and direction of the rotation. 
Alternatively, if the frame Ox,x2xz3 is fixed then (a,,) represents the 
rotation in space of the line OP through an angle 9, to take up a 
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position OQ where P is the point whose coordinates are (b,, bo, b3) 
and Q is the point whose coordinates are (B,, Bz, B3) and OP = OO. 
This is illustrated in Fig. 9.17. 

Now let x be a vector in the direction of OP, i.e. x = tb, and let y 
be a vector in the direction of OQ, i.e. y = sB, where ¢, s are scalars, 
then 


y= Cx 
where C = (ci) 
5 
=F (ai). 


The matrix C contains all the information necessary to give the relative 
positions and magnitudes of the vectors x and y. 

Note that solving equation (13) for b leads to b = A~1B and hence 
it follows that 


A’=Aq} 
or AA’ =I. 


Matrices such as A which satisfy this relationship are called orthogonal 
and have an important role in matrix application to engineering prob- 
lems, particularly in connection with those involving vibration. 
Note also that in suffix notation the equation y= Cx may be 
written 
Vi = ἐμχ; 


where (x,;) and (y,;) are column matrices of order (3 x 1) and (c;,) is 
necessarily a square matrix of order (3 x 3). 


9.4 VECTOR EQUATION OF A STRAIGHT LINE 

The direction of a straight line in space is determined by its direction 
cosines (or ratios) and its position is fixed in space if the coordinates 
of any point on the line are known. 


Fig. 9.18 
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Suppose a straight line has a direction specified by a constant vector 
b (i.e. in a Cartesian coordinate system by Ὁ = δι + b,j + δ5Κ so that 
the line has direction ratios [b,, bs, b3]) and passes through a fixed 
point A in space whose position vector relative to an origin O is a (1.6. 
in Cartesian coordinates a = a,i + a,j + ask so that A has coordinates 
(αι, 45, 43)). In Fig. 9.18 the points A and B are given (for a given a 
and b) and the straight line is obtained by producing the line segment 
AB in both directions. 

The equation of the line is obtained by stating mathematically the 
geometrical condition that the variable point P is constrained to lie on 
AB and on AB (or BA) produced. Let OP = r be the position vector 
of P, then in Fig. 9.18 


OP =r = OA + AP. (15) 
But, from section 9.3.4 
AP = rAB 
= tb (16) 


where ¢ is some scalar. 
For fixed b, different values of 1 give different points along the line. 
From equations (15) and (16) the required equation is 


r=a+ tb. (17) 


It is important to note that the vector equation (17) is independent of 
any system of coordinates. The scalar quantity ¢ is a parameter that can 
range from —0oo to - οὐ. If t> 1, P lies in AB produced; if ¢ = 1, 
P coincides with B; if 0 <t< 1, P lies on AB; if t = 0, P coincides 
with A and if t < 0, P lies on BA produced. 

In a rectangular Cartesian coordinate system 


r= xX,i + χοὶ + xk, 
a= ai + a,j + ask, 
and b = b,i + boj + bak 
so that equation (17) becomes 
χιὶ + χαοὶ + xk = (Qui + Goj + aak) + t(b,i + δε] + Sk). 
From section 9.3.5 this is equivalent to three scalar equations 
X, = a, + th; 
Xo = do + the (18) 


ΧΑ = a3 + ths. 


The set of equations (18) are called the parametric or freedom equations 
of the line. Rearranging (18) gives the usual form of constraint equation 
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of the line 
ee ey (19) 


It is worth noting that if the line lies in the two-dimensional plane 
Ox 1X, then a, = 0 and, since the line is at right angles to the x-axis, 
bs = 0 so that equation (19) reduces to 


xy ye αι Xo pa ag X3 


Be he Oe σῳ 
The equations (20) are equivalent to 
ΣῈΞ es a, (21) 


δ. be 
If (x1, χα, Χ9) is replaced by (x, y, z) the equation (21) becomes 
y — ag = bx — a)/b,,z = 0 
which is equivalent to the well-known form 
y=emxt+ce, z=0. 
Example (i) Find the equation of the line passing through the two 


points R and S whose Cartesian coordinates are R(3, 2, +1) and 
S(6, —3, 4). 


Fig. 9.19 q 


Solution The direction of the line is specified by the join of the two 
points. In the notation of Fig. 9.19 


4 -- 3: Ὁ 2] ἘΚ, 
r= Xi + χα} + Xgk 


and ς = 6i — 3] + 4k 
so that RS =b=c-—a 
= 31 — 5] + 3k. 
The vector equation of the line is then in the form 
r=a+tb 


which, using equations (17) and (19), leads to 


X,—-3 χο-- 2 x-~1 
Ἐπ τες CO) (22) 


204 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


Note that the vector equation could have been written using the point 
S instead of the point R, the equation would then have taken the form 


r=c+sb 


where s was the parameter. The equation equivalent to (22) would then 
have been 


fe as τ ϑο 23) 


That equation (23) is the same as equation (22) is immediately evident 
if s is replaced by t — 1. 


Example (ii) Find the equation of the line through (0, 3, 2) and parallel 
to the join of (4, 3, 1) to (10, 3, —2). 

Solution The join (4, 3, 1) to (10, 3, —2) has direction ratios [6:0: —3]. 
Write a = 3j + 2k and b = 6i — 3k then the vector equation is 
r =a + tb which leads to the scalar equations 


Note that the zero in the direction ratio shows that the line is perpen- 
dicular to the axis of y. Each point of the line has a y-coordinate of 3. 


9.5 THE POSITION RATIO WITH REGARD TO TWO 
BASE POINTS 
Given two points A and B the position ratio gives the coordinates of 
the point P that divides the join of A to B internally in the ratio p:q. 
A Pp 8 


Fig. 9.20 


In Fig. 9.20 let O be the origin and let OA = a, OB = b be the position 
vectors of the given points A and B. Let P divide the join A to B in 
the ratio p:q then 

AP/PB = p/q 
or AP/AB = pi(p + 9) (24) 
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In vector form, 
AB=b—a 
and AP = ἐρίῳ + 4)}0 — a). 
The position of P is determined by the vector OP or r where 
r= OA + AP =a + {p/(p + g)}(b — a) 
= (pb + ga)/(p + 9) (25) 


If a= x,i+ γι} + 2k; b = χροὶ + yoj + zk; and r= xi+ yj + zk 
then equation (25) leads to three scalar equations for the coordinates 
of P, namely, 
‘i PAA - Pat qM een ol Ὁ 42ι 
pre: pt+q? pt+q 
If the join A to B is divided externally in the ratio p:q the equation (24) 
is replaced by 


(26) 


AP/AB = ρίῳ — 9) 


and equation (26) is replaced by a similar equation with (—q) instead 
οὔ. 
Example Find the coordinates of the point dividing the join of (1, 2, 3) 
to (5, 7, 8), (a) internally in the ratio 3:2, and (b) externally in the ratio 
5:7. 
Solution 
(a) With (41, y1, 21) = (A, 2, 3) and (x2, yo, Z2) = (δ, 7, 8) the point has 
coordinates 

3.5+2.1 17 


fem Fe et OR 
3.74+2.2 
ae 342 “-5 
pnb 8H2.3_, 
3+2 


(Ὁ) Using p:q = 5:—7 gives the coordinates (—9, —21/2, —19/2). 


9.6 PRODUCT OF TWO VECTORS 


So far only the addition of vectors and the multiplication by a scalar 
have been considered, but any vector algebra is lacking unless some 
meaning can be attached to the product of vector quantities. It is 
evident that the ordinary ideas of (scalar) multiplication cannot be 
sufficient because of the directional properties possessed by vectors. In 
order to give a motivation for the subsequent definition of the product 
of two vectors, consider now the multiplication of a force vector Fanda 
displacement vector d. Suppose for simplicity that the force F is constant 
(i.e. constant in both magnitude and direction) and that it acts at a 
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point P as in Fig. 9.21. Consider the work done by the force as the point 


P moves to a point Q where the displacement vector PQ is denoted by d. 
Let the angle between F and ἃ be 6, then the work done by F is defined as 
the product of the displacement and the force component in the direc- 
tion of the displacement, i.e. (F cos θ)ά. This scalar quantity Fd cos @ 
has been arrived at by forming, in a particular way, the ‘product’ of the 
two vectors F and d. 

As a second illustration of the product of two vectors, suppose that 
the force F acts at one end Q of a rigid rod PQ whose other end P is 
freely pivoted. Consider the moment of the force about the point P. In 
Fig. 9.21 the magnitude of this moment is defined as the product of the 
force and the length of the perpendicular from P to the line of action of 
F, ie. F(d sin 0). The moment of the force has, however, a direction 
associated with it as well as a magnitude and is a vector quantity. The 
direction of the moment of F about P is perpendicular to the plane of 


Fig. 9.21 

F and d and in the sense of the longitudinal movement of a right- 
handed screw laid in that direction and rotated in the direction that F 
is attempting to turn PQ about P. In Fig. 9.21 if F and d are in the 
plane of the paper then the moment about P is perpendicular to the 
paper and directed away from the reader. Hence it is possible to arrive 
at a vector quantity by forming, in a particular way, the ‘product’ of 
two vectors F and d. 

9.6.1 The scalar product 

Given two vectors a and b the scalar product (sometimes called the dot 


product) is defined as the scalar quantity ab cos 6, where θ is the angle 
between the two vectors and is usually written a . b, i.e. 


a.b = αὐ οος 9 
= a(b cos θ) 
= b(acos 6) 
ΞΕ". ἃ. (27) 
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Note that if a and b do not lie in the same plane then θ is the angle 
between two vectors parallel to a and b respectively drawn through 
any given point. The scalar product may be positive or negative 
depending on the angle 6. If 6 =0 the vectors are parallel and 
a.b = ab, in particular 4. 8 = a? = a’. 

The scalar product may be used to find the projection of a vector 
in any given direction. In Fig. 9.22 the scalar projection of the vector 
b in the direction of a is PN, and from equation (27) 


abcos@ a.b 


PN = bcos§ = =—. (28) 


Fig. 9.22 


: ΕΝ . b.n P 
Likewise the projection of b in any direction n is oer Note particu- 


larly that b and n do not have to be coplanar vectors. The scalar product 
obeys the following laws of algebra: 


The commutative law, a.b=b.a (29) 
The associative law, a(a.b) = ca. b=a.ab (30) 
The distributive law, a.(b +c) =a.b+a.c. (31) 


If equation (31) is divided by a then the distributive law states that the 
component of (Ὁ +c) in the direction of a is equal to the sum of 
the component of b in the direction of a and the component of ς in the 
direction of a. The distributive law may be extended in the form 


(a+b).(c+d)=a.c+a.d+b.c+b.d. (32) 

From equation (27) it is easily seen that the scalar product a . b vanishes 

not only when either a or b is zero but also when cos θ is zero. Thus, 

two non-zero vectors whose scalar product vanishes must be perpen- 

dicular. In particular if i,j, and k are the unit vectors along the ortho- 

gonal Cartesian coordinate axes then 

i.i=j.jok.k=1 (33) 

and i.j=i.k=j.k=0. (34) 

It follows immediately, using equations (32), (33), and (34), that if 
a = ai + a,j + ask and b = b,i + b,j + 53k, then 

ἃ «Ὁ = (ai + a2j + ask). (δι1 + δε] + dak) 
= a,b, + Agde + Agb3. (35) 
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Example (i) Find the scalar product of a = 2i— 3j+ 7k and 
b = 3i + j — 5k. 
Solution Here, 
a.b=6—3— 35 
using equation (35), i.e. 
a.b= —32. 
Example (ii) Find the projection of a = 2i — 3] + kon tob =i + 5k. 
Solution From equation (28) the projection of a in the direction of b 
is a. b/b which is 
2+0+5 “ἢν 
Laas Re 
Example (iii) Find the angle between a = 2i — 3] +k and b= 
3i + 5j — 2k. 
Solution From equation (27), 
cos 6 = a.b/ab 
(6 — 15 — 2)/(v/14)(V/38) 
—0.4768 
and θ = 118° 29’ = 180° — 61° 31’. 
Note that in any given problem it may be sufficient to find the acute 


angle between the two vectors, i.e. to ignore any negative sign. Strictly 
the angle of 61° 31’ in the example above is the angle between a and —b. 


Example (iv) The point of application of the force 10i + 7j + 8k 
newton moves a distance of 3 metre in the direction of (a) the vector 
2i + 5j + 3k, and (b) the vector i+ 2j — 3k. Find the work done by 
the force in each case. 


Solution Let the force be F and the displacement be d then the work 
done in each case is F. d. 
(a) The unit vector in the direction of 2i+5j+ 3k is 
ΟἹ + 5] + 3k)/+/38, therefore 
= 3(1 + 5] + 3k)/+/38 
and the work done by the force is 
W =F.d = (10i + 7] + 8k). ΑἹ + 5] + 3k)(3/+/38) 
= (20 + 35 +°24)3/4/38 
= 237/4/38 joule 
(b) The displacement vector in this case is 
d = 30 + 2j — 3k)//14 
and the work done is 
W=F.d = (10 + 14 — 24)(3/4/14) 
= 0 joule. 


_ 
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No work is done by the force when it moves in a direction at right 
angles to its line of action. 


Example (v) Show that the angle in a semi-circle is a right angle. 


Solution It is required to show that in Fig. 9.23 AP and BP are at 
right angles for any position P on the given circle centre O, diameter 
48. 


Α Z B 
Fig, 9.23 ; 
Write OB =a and OP =b, then AO =a and AP = AO+ OP 
=a-+b, and BP = BO + OP = —a+b. 
The scalar product 
AP. BP = (a + b).(—a + b) 
—a.a+b.b 
== f? — q? 
== 0, since OP = OB. 
The scalar product is zero and since AP and BP are non-zero they are 
at right angles. 


Ι 


9.6.2 The vector product 
Given two vectors a and b the vector product (sometimes called the 
cross product) is another vector ¢ written ¢ = a x b such that 


(i) ¢ is at right angles to both a and b, 
(ii) |c| = ab sin 6, where @ is that angle between a and b such that 
0<60<7z, 
(iii) a right-handed rotation about ς through an angle 6 would move a 
into the direction of b. 


The vector product may also be written 
aX b= (absin θ)η (36) 


where ἢ is a unit vector, [πη] = 1, in the direction of ς (Fig. 9.24). 
These properties imply that the vector b x a is identical with —c, or 


bx a= —(a x b). (37) 
The vector product does not obey the commutative law of algebra and 


it is essential in any manipulation of vectors involving the cross product 
that the order of the vectors is strictly maintained. 
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The vector product a x b is zero when either of a, b is zero or when 
sin 9 = 0. Thus two non-zero vectors have a zero vector product when 
they are parallel. 

If a and b are at right angles then a x Ὁ = abn. The base unit vectors 
i, j, k, of a right-handed Cartesian coordinate system are mutually 
orthogonal and have the following properties 


ixj=-jxi=k (38) 
jxk=-kxj=i (39) 
kxi=-ixk=j (40) 
ixi=jxj=kxk=0. (41) 


Although the vector product does not obey the commutative law it 
may readily be seen that it can be associated with a scalar « in the form 


a(a x b) = (wa) X b= a Χ (ab). (42) 


Fig. 9.24 


The vector product satisfies the distributive law in the form 
ax(b+c)=axb+axe (43) 


but the proof of this law is not obvious. (For a geometrical proof the 
reader is advised to consult a book devoted to vector analysis.) 
The Cartesian form of a x b may be derived as follows. Suppose c 
of Fig. 9.24 represents a x b and the Cartesian forms of the vectors are 
a= αιἷ - aj+ ask 


b = byi + baj + bok 


with c=a x b= absin On = ci + coj + cok. (44) 
Thus ς = (ai + aj + ask) x (6,1 + δα] + δ51) 
= Gi X (b,1 + Boj + δ5Κ) 
+ a,j Χ (bri + δα) + bak) (45) 


+ ask X (bi + δι] + bsk) 
using the distributive law (43). 


τα 
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A further application of (43) and the use of equations (38)-(41) leads 
to 
c=a X b = (agbs — agbo)i + (asb1 — aibs)j + (αιδα — G2bi)k (46) 
or, in determinant form, 
ij k 
a, Gg a3}. 
δι bz bs 


axb= (47) 


Example (i) Find the vector product of a= 3i+ 2j—k and 
b=i— 4j — 2k. 


Solution Here, 


ij k 
axb=|3 2 —1|=—8i+ 5j— 14k. 
i ae 


Example (ii) Find the equation of the straight line passing through the 
point (3, 4,5) having a direction perpendicular to the lines whose 
direction ratios are [1:0:2] and [—1:2:3]. 


Solution The direction of the line is to be perpendicular to the vectors 
a =i+ 2k and b= —i-+ 2] + 3k. This direction is parallel to the 
vector 


i Ι κα 
axb= 1 O 2)= —4i — 5] + 2k. 
-1 2 3 


The equation of the required line is given in vector form by 
τ = Οἱ + 4j + 5k) + 1(-- Δ — 5j + 2k) 
where r = xi + yj + zk, or in scalar form by the equations 
CS γ-4 2--5 


9.6.3 The matrix representation of a x ὅ 
Let axb=c=ce; 


then the components of a, b, and ¢ all form (1 x 3) row matrices. It 
is of interest to note that the vector product may be expressed in a 
matrix notation by 


0 --Ξ be 
(Cy Cz €3) = (Ay 44 (3) bs 0 —b,]. 
—be ὃ, 0 
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This may be further reduced into a concise matrix notation given by 
Ci = ἐμ ἰδ 
in which i, j, and k are suffixes that take all possible values 1, 2, and 3 
and e,,, is defined as follows, 
€i;, = 0, if any two suffixes have the same value, 

= +1, if the suffixes have the cyclic order 123, 231, or 312, 

== —1, if the suffixes have the acyclic order 132, 321, or 213. 
For example, 

Co = €2j, a;b, (summed over all values of ), k) 

= €931 4581 + 6215 αχθᾳ + terms in which δ; is zero 

= (+1)asby + (—Dayb, 

= εὖ, — aybz. 
Note that the suffixes i, j, and k are simply symbols that take the values 
1, 2, or 3 and have no connection with unit vectors i, j, and k. 


9.7 THE PLANE AND THE STRAIGHT LINE 

A plane is a surface in space for which the join of any pair of points 
of the surface is a straight line lying entirely in the surface and perpen- 
dicular to a given direction known as the normal to the plane. 


Fig. 9.25 


Suppose, as in Fig. 9.25, that A lies in the plane and that the position 
vector of A is OA = a. Let P be a variable point such that OP = r. 
Let ς be the vector giving the normal direction, then the join of A to P 
lies entirely in the plane passing through A if AP is perpendicular to ς, 


Ἰὰ AP.c =0 
or (r—a).c=0, (48) 
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Equation (48) is then the equation of the plane and is a scalar expres- 
sion. 


In Cartesian form, if 
a= a,i + aj + ask, 
ς = li+ mj + nk, 
and r= xi+ yj + zk, 
then the plane equation has the form 
I(x — αὐ + my — ae) + -n(z — as) = 0 (49) 


From equation (48), the equation of any plane whose normal is in the 
direction of ς has the form 


r.c= 8 (50) 
or Ix + my + nz = β. (51) 
In another notation the plane equation has the form 
Qy1X1 + AyoX2 + ἀιοχᾳ = by 


and is a linear, algebraic equation of the type discussed in Chapter 8. 
Geometrically, a plane in space will usually meet a second plane in a 
common line of intersection. This line will then meet a third plane in 
a single point. Thus three planes will in general meet in a single point 
of intersection. The discussion of section 8.7 concerning the solution of 
n linear, algebraic equations in n unknowns has a simple geometrical 
interpretation when n = 3. The regular solution of section 8.7.1 is 
equivalent to the case of three planes meeting in a unique point of 
intersection. The equation (54) of section 8.7.1 is simply the case of 
three planes through the origin. The inconsistent case of section 8.7.2 
is geometrically equivalent to the case of three planes whose three lines 
of intersection form the edges of a triangular prism, i.e. the lines are 
parallel. The indeterminate case of section 8.7.2 is equivalent to the 
three planes having a common line of intersection. 

Many problems concerning the straight line and the plane can be 
dealt with using vectors and a few problems are solved below. 


Example (i) Find the equation of the family of planes whose normals 
are in the direction of the vector 2i — 3] +k. Find that member of 
the family passing through the point (3, —1, 2). 


Solution From equation (51) the family equation is 
2x —3y+z= 6 
The particular plane through the point (3, —1, 2) has the value of β 
given by 
26) -- 3(-1I) + ὦ =8 
that is B= 11. 
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The required equation is then 
2x —3y4+z=11. 


Example (ii) Find the equation of the plane containing the line 


Solution The plane must pass through the point (3, 1, —1) and have 
its normal perpendicular to the directions [2:1:3] and [1:3:—1). If the 
normal vector is ¢ then 


i j k 
ςσΞ [2 1 3 | = —10i + Sj + 5k 
1 3 —1 


and the equation of the plane is 
—10(x — 3) + 50 -- ἢ +5 +1) =0 
or 2x—-y—-z=6., 


(Note that the point (1, —2, —4) is not used in determining the equa- 
tion.) 


Example (iii) Find the coordinates of the point of intersection of the 
line r = a + tb with the planer .c = β. 


Solution A straight line meets a plane in one point, in general. The 
point will be determined when the line and plane have a common 
position vector, i.e. when 

(a+ tb).c= 8 
or ἃ. Ὁ tb.c) =f. 


The value of ¢ is then given by 


Oe Sek ae 
t= Bc if b.c 0, 


and substitution into the equation r= a+ tb gives the required 
position vector. 

If b. c = 0 the line is perpendicular to the normal, i.e. parallel to the 
plane, and there is no point of intersection (except at infinity) unless 
B also has the value a.c (which implies that a lies on the plane) and 
then ¢ is indeterminate, the line lies in the plane, and every point of the 
line is a solution. 


- 
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Example (iv) Show that the lines 


x—-5 y-1 z—2 x+9 y—6 244 
SF eae ee, Sgt Ξτ 


intersect and find the coordinates of the point of intersection. 


Solution Usually two lines in space are skew (i.e. they do not intersect) 
and the condition for intersection is simply that they have a point in 
common. 

Let the parameter of the first line be ¢ and that of the second line 
be s, then the condition that there is a common point is the condition 
that the equations 

5 Ἔ 2t= —9 + 4s 
14+ t= 6-—2s 
2+3t=-—4+4+ 5s 


are consistent. The condition for consistency from equation (56) of 
section 8.8.2 is that 


14 2 4 
-5 1 —2}=0. 
6 311 
The reader can easily verify that this is so and also that ¢ = —1 (or 
s = 3). Using the first equation with t = —1, the intersection point is 


given by x = 3, γ = 0, andz = —1. 
Example (v) Find the shortest distance between the lines 


Fig. 9.26 


Solution In Fig. 9.26 the shortest distance is given by RS the common 
perpendicular. The direction of the perpendicular is given by n where 
n = b x B, and b and B are vectors in the direction of the lines. 
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Let P and Q be points on the lines with position vectors a and A 
respectively, then the length RS is the projection of PQ on to n. Since 
PQ = A — athen 

_|A—a).n 
> n 
Here, a=3i—j+4k, b=2i+j+k, A= -- -- 2]  Κ, and 
B = 3i — j — 3k, hence 


RS 


ae (ee = oe 
n=bxB=/2 1 1 
3, τὴν eB 
= —2i + 9j -- 5k. 


The shortest distance is 
(--Τὶ — j — 3k) .(—2i + 9j — 5k) 
V/(4 + 81 - 25) 


9.8 MOMENT OF A VECTOR ABOUT A POINT 

If a force acts on a particle at a point Q then the moment of the force 
about a point P is d x F where ἃ is the position vector of Q relative to 
P as in Fig. 9.27. The direction of the moment is perpendicular to the 
plane of d and F and in the right-handed sense. The magnitude of the 
moment is Fd sin 6. 


= 20/4/10. 


P 


Fig. 9.27 


Example A force of magnitude 4 newton acts at the point (3, 1, ~4) 
parallel to a line whose direction ratios are [—1:1:—1]. Find the 
moment of the force about the point (1, 0, —2). Distance units are in 
metres. 
Solution Write a =i — 2k, b = 3i+ j— 4k. The force vector is 
F = 4(—i + j — k)//3 newton. The moment Μ is given by 
M = (b—a)xF 
= (i + j — 2k) x (-i+ j — HG/v3) 
4 i jk 
ΞξΞ --ὁ 2 1 -2 
he το ee 


= 44 + 4j + 3k)/+/3 newton metre. 


- 
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It may be noted that M has three components (4/+/3), (16/+/3), and 
(12/4/3) respectively, in the directions of the coordinate axes. 

The concept of the moment of a force about a point can be extended 
to any vector quantity although the extension may not always have any 
physical significance. For example, if v is the velocity vector of a 
particle of mass m at the point P and r is the position vector OP then 
my is the linear momentum vector and r x mv is the moment of 
momentum (or angular momentum) of the particle about the origin O. 
This quantity is fundamental in mechanics. 


9.9 THE PRODUCT OF THREE VECTORS 
The product of two vectors is either a scalar or a vector quantity and 
in each case may be multiplied by a third vector. 

Given two vectors b and ς, the product b.c is a scalar, A say, and 
may be multiplied by the third vector a to give (b.¢)a or da which is 
simply a vector of length Aa in the direction of a. The product of a 
with the vector product b X cis of more interest and has many applica- 
tions in mechanics and other fields. 


9.9.1 The triple scalar product 


The product of a with b x ¢ may be evaluated in the form a. (Ὁ Χ c) 
and the result, being a scalar, is called the triple scalar product of a, b, 
and ς. Let a, b, and ς be drawn at a point P and the parallelepiped 
constructed as in Fig. 9.28. 


Fig. 9.28 


Let θ be the angle between b and ς, and ¢ the angle between a and the 
normal to the plane containing b and c. By definition 


b x ς = (δε sin 6)n 


where n is a unit vector in the direction of the normal to the plane 
containing b and ς in the right-handed sense with respect to b and c, 
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The triple scalar product is given by 
a.(b X c) = a.(bcsin )n 
= (ὃς sin 6)(a. n) 
= (bc sin 9)(a cos φ) 
= (bc sin 0)PN 


where PN is the ‘height’ of the parallelepiped. Since bc sin θ is the area 

of the ‘base’ parallelogram PBDC, the triple scalar product may be 

represented as the volume of the parallelepiped with sides a, b, and c. 
If 


a= ai + Adj + ak, b = διὶ + δε] + b3k and c= Ci + Coj + C3k, 
then 

ij k 
δι by bs 
α Cg (8 


bxc= (52) 


and 

ἃ. (Ὁ Χ ὦ = (αιἱ + aaj + ask). (44 + Aaj + Ask) 
where A,, 42, and Ag are the cofactors of i, j, and k in determinant (52). 
Evaluating, 


Qa, ag ag 
δι be bg |. 
Cy Cg (8 


4. Χο) Ξ (53) 


From the properties of interchange of rows in the determinant (53) 
it follows that 


a.(b Χ c)=b.(c X a) = Cc. (a x 5). (54) 
The commutative property of the scalar product shows that 
ἃ. X c) = (a x ").ς. (55) 


The brackets in (55) are frequently omitted and then 


a.bxc=axb.c 


which shows that the . and x of the triple scalar product are com- 
mutative. Notations in common use are (a, b, 6) or [a, b, 6] or simply 
abc. Although the brackets are frequently omitted, the reader is advised 
to retain brackets until confidence has been gained in vector manipula- 
tion. Note that a quantity such as (a.b) x ¢ is meaningless because 
(a. b) is a scalar quantity. 

From the property of the determinant (53) 


(a, b, c) = —(a, ς, b). 
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The triple scalar product vanishes 
(i) if any of the vectors is zero, 
(ii) if, in Fig. 9.28, cos φΦ = 0, ie. if φ = 7/2, 
(iii) if any one vector is a scalar multiple of either of the other vectors. 


This follows.immediately from the property concerning equal rows 
in the determinant (53). 


From (iii) the triple scalar product vanishes if any two of the vectors 
are parallel. From (ii) the necessary condition that three non-parallel 
vectors are coplanar is that the triple scalar product vanishes. 


Example (i) Find the triple scalar product of the vectors 2i — 7j + 3k, 
i — j, and 3i + 4k. 


Solution The triple scalar product is 


77.3 
1 —1 0]=3@) +45) - 29. 
3. 0. 4 


Example (ii) Find the condition that the two lines r = a + rb and 
r= A + SB intersect. 


Solution An inspection of Fig. 9.26 shows that the lines will intersect if 
the vectors (a — A), b, and B are coplanar. The condition for this is 
simply 

(a — A).(b x B) = 0 
which is 


9.9.2 The triple vector product 
The product of a with b x ς may be evaluated in the form a x (ἢ Χ c) 
and the result, being a vector, is called the triple vector product of 
a, Ὁ, and c. With reference to Fig. 9.28, 
b xX c= (bc sin 6)n 
and a X (Ὁ Χ ὦ = (bc sin O)(a x n) 
= (bc sin 6)(a sin 6)m (56) 

where m is a unit vector perpendicular to a and n in the right-handed 
sense. This vector m, and hence a x (Ὁ x o), is in the plane of b and ς 
and from Fig. 9.11, with a a null vector and OR in the plane of b and ς, 
it follows that 

ax (bx c) = μὴ + re (57) 


where yz and » are scalars. 
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Likewise, 
(a ΧΡ) X c= —c x (a x dD) 
= aa + fb (58) 
where « and β are scalars. 

The results (57) and (58) show that, in general, a x (Ὁ Χ οὐ and 
(a x b) x ς are different vectors. The position of the bracket in a 
triple vector product is vital to the definition and must be retained. The 
quantity a x b x cis not fully defined. 


To find the value of μ and » of equation (57) first form the scalar 
product of both sides with the vector a, i.e. 


a.f{a Χ (Ὁ X ©} = μί(. b) + r(a.0) (59) 
The left side of (59) is a triple scalar product [a, a, (Ὁ x ¢)] with two 
equal vectors and is therefore zero. Hence, 


= = 1 (say) (60) 


and substitution in (57) gives 
a X (b X c) = ¢{(a. c)b — (a. b)c}. (61) 
The equation (61) is to be true for all vectors a, b, and ς. Suppose 
a= Qi + aj + ask 
b = δι1 + bj + bck 
and ς = αἱ + Coj + cok, 
then b Χ ¢ = (bgc3 — bgca)i + (b3¢, — δι64}} + (Bice — becy)k 


and the component of a x (Ὁ x c) in the x-direction is 


A{b1Cz — θ201) — ag(bsc, — by C3). (62) 
The component of (a. c)b — (a. b)c in the x-direction is 
(στον + Aoeg + ἀ569)8. — (ayb1 + debz + agbs)e, (63) 
Comparison of (62) and (63) shows that in (61) 
t=1 
and that finally 
a Xx (b X c) = (a.c)b— (a. bode. (64) 


Example (i) Verify the result (64) ifa = 2i-j+k, b=i-—j, and 
e=i+jt+k. 


Solution Here, 


i j k 
bxe=|1 -1 O|]=-—i-—j+2k. 
1 1 1 
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The left side of (64) is 


ij κ 
ax(bxc)=| 2 -1 1|=—i-5j—3k. 
= ee Gee 


With (a.c) = 2 and (a.b) = 3, the right side of (64) is 
(a.c)b — (a. be = 20 — ἢ — 30 4+j +h 
= —i— 5] — 3k 
which is identical with the left side. 


Example (ii) Given x X a = band x.c = «, find x. 
Solution The triple vector product of c, x, and a is, from (64) 
c X (x X a) = (c. a)x — (ς. x)a 
= (a.c)x — aa. 
Put x Χ a= b then 
c xX b= (a.c)x — aa 

and so x= ene . 
Example (iii) Prove that (a x b).(c x d) = (a. e)(b. d) — (a. δῶ. ὦ. 
Solution First consider (c x d) as a single vector V, say, then 

(ax b).(c Χ d)=(ax b).V 

= (a, b, VY) 

a.(b x V) 
a.(b x (εχ d] 
a.[(b. d)c — (b. c)d] 
(a.c)(b.d) — (a. d)(b.c). 


Ι 


9.10 DIFFERENTIATION OF VECTORS 


In most physical applications the vectors involved are not constants 
but are variable with respect to some scalar quantity. This scalar is 
frequently the time. For example the velocity v of a rocket being pro- 
jected from the Earth is clearly a function of the time since blast-off. 


9.10.1 Differentiation with respect to a scalar 
Suppose a is a vector which is a function of a scalar wu, this is written 


a=f(u) or a(u). (65) 


Let u change to u + du such that a changes to a + da where a + da 
is a(u + du). The derivative of a with respect to u is defined in the same 
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way as for the scalar functions of Chapter 2 by the limiting value of 
the ratio of da to du and written 


δᾶ . a(u + du) — a(u) 
qi Lim 5, = Lim (66 
The ratio of da to du is in the form of a scalar (1/6u) multiplying a 
vector da and is a vector in the direction of da. In Fig. 9.29, OP repre- 
sents the vector a and OQ the vector a + da so that PQ represents the 
vector da. 
In general the vector da is in a different direction from that of the vector 
a. Write PQ = PQs where s is a unit vector in the direction of PQ, then 


Q q 
iP 
ατδα ‘a. 
oO 
Fig. 9.29 Fig. 9.30 


the limit process of (66) involves both the limit of the magnitude ratio 
(PQ/éu) and the limit of the direction of the vector 8. Let the latter be 


the unit vector q then 
ζι 
τες [Lim | q (67) 


suo OU 


that is, the derivative da/du is in the direction of q. Note again that 
this direction is not necessarily in the direction of a. 

Consider the particular case when a is the position vector of a 
moving particle P. Replace a by r and μ by the time ¢, then in Fig. 9.30 


OP =r =r() 


and or is the displacement PQ of two points on the path of the particle. 
The limit of PQ/dt is the speed of the particle and the limiting direction 
specified by the unit vector q is that of the tangent to the space curve 
at P. But speed with an associated direction is simply the velocity; this 
is written 


τι Ξ ᾿Ξ ee (68) 
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Similarly, the derivative of the velocity may be evaluated to give the 
acceleration f, i.e. 


. V(t + δὲ) — We) 
=n “9.9 00 
anes ot 


Using Fig. 9.29 with a replaced by v it is evident that the acceleration 
vector is not necessarily in the direction of the velocity vector. In 
general the vectors of position, velocity, and acceleration are in three 
different directions. 

The derivative of a vector has been defined in the same way as for 
a scalar function and the following results may be obtained 


(69) 


fatn=F+2 (70) 

£ (fa) = a, ἜΤ (1) 
foneBaved (72) 
ἀολοτ (Ende) +bag) ὦ 
faxb)= δ xptaxe (74) 
Zax bx ja Gxbxotax (Fxe)+ax (bx δ) 
(75) 


It is important that the order of the vectors be unchanged whenever a 
vector product is involved. 


Example (i) A particle moves so that its position at time ¢ is given by 
r = 2 sin ti + 3 cos 1] + 7k where i, j, and k are constant base vectors. 
Show that 

F+r= (5- 2)k. 


Solution If r = 2sin ti+ 3 cos ἢ] + tk 

then t = 2 cos ti — 3 sin tj + 2tk 

and ἢ = —2sin ti— 3 cos tj + 2k 
= —(r — ¢?k) + 2k. 

Hence F+r= (¢? + 2)k. 


This result is a vector differential equation and represents three scalar 
equations of the type 


¥+x=0, jt+y=0, and Z+4+ z= (t? + 2). 
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Example (ii) What can be deduced about the acceleration of a particle 
that moves such that r x fr is constant? 
Solution Let ¢ be a constant vector such that 
rxr=c 
then, differentiating and using (74), 
rxr+rxF=0. 
Since i X i is zero, by definition, the acceleration is such that 
rxf=0. 
Either the acceleration is zero for all time or the acceleration vector is 
parallel to the position vector, that is the acceleration is directed to- 


wards (or away from) the origin. The particle is therefore moving under 
the action of a central force. 


If, in equation (72), a = b then 


d da 
2-4) ΞΞ 24. 
but 8. ἃ = qa? and it follows that 


a. =a 
du 


Hence if a is a vector of constant magnitude, i.e. |al| = a = a constant, 
then da/du = 0 and equation (76) shows that 


a.da/du = 0 (77) 


from which can be deduced the fact that a and da/du are perpendicular. 
Note that this is all that can be deduced from (77), as in general |da/du| 
is not zero. In particular, da/du = 0 but |da/du| ~ 0. 


(76) 


9.10.2 Differentiation of a unit vector in two-dimensions 


Consider a unit vector p at the origin of coordinates O at time ¢. 
Suppose p makes an angle θ with some given direction and that in a 
small time interval δ the vector takes up a new position at angle 
6 + 66 (Fig. 9.31). 
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Because |p| = |p + dp] = 1 = constant, then dp/dt is at right angles to 
p in the direction of the unit vector q. The magnitude of dp/dt is 


dp . ΡῸ 
ΓΝ ron δὲ 
Lim 2 sin (60/2) 
= 1 ---- -- ----.-᾿᾿-..-,.--ἘἘ 
δί-σο δι 
Lim sin (66/2) 66 
ee] -- 
sre (0)2) OF 
_# 
~ dt 
dp, 
and so τι θα (78) 


Any vector a may be written in the form ap where p is a unit vector 
in the direction of a. Differentiating and applying (71) and (78) leads to 


da ὦ 
τὶ Ξ 7A 
_& dp 
ΞΡ oa 
da 
=F P+ aq (79) 
The equation (79) shows immediately that 
da |? da\? a 
ahs (3) + a6 (80) 


and that |da/dt| and (da/dt) are different quantities, unless 6 = 0, i.e. 
unless a is in a constant direction. 


9.10.3 The velocity and acceleration components of a point moving in 
two-dimensions 


Let OP = r be the position vector of a moving point at time f. In many 
applications it is necessary to express the velocity and acceleration in 
Cartesian or polar coordinates. 


In Cartesian coordinates 


r=xi+ yj 
vo t= xit+ yj (81) 
and fav=Ff= χἱ Ὁ jj 


In polar coordinates, write 


r=rp 
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where p is a unit vector in the direction of r then from equation (79) 
᾿ dr : 
v=r= (]) p+ con 


Ξε ἡ Ὁ γθᾳ. (82) 


The result (82) states that the velocity v may be resolved into two 
components at right angles, 7 in the radial direction and ré in the 
transverse or cross-radial direction. These components together with 
the Cartesian components are illustrated in Fig. 9.32(a). 


¢ . 
+ wt (r28)q 


Fig. 9.32 
Differentiating equation (82) again 
d " d 
τον τ το rp tre + rq + OF + ry 


= Fp + γύᾳ + γθᾳ + ἐδᾳ + r6? (—p) 
on noting that dq/dt has a magnitude of 6 and a direction at right angles 
to p in the sense of increasing 9, i.e. in the direction of —p. 
Finally, ; ᾿ 
f = (F — ré)p + (276 + rO)q. (83) 


The two components of the acceleration are then ἢ — r6? in the radial 
sad ; . ld : ἐπε ρος 
direction and 276 + γθ or er (r?6) in the transverse direction. These 


acceleration components are illustrated in Fig. 9.32(b). 

If the point P moves in a circle of radius a so that |r| = r = a the 
polar acceleration components are given by (—a6?) in the radial 
direction and a@ in the cross-radial direction. If further, the angular 
speed ὦ is constant then the cross-radial acceleration component 
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vanishes and the radial component of (—a6) remains. Thus a point 
moving round a circle with a constant angular speed experiences an 
acceleration directed towards the centre of the circle, the so-called 
centripetal acceleration. 


9.11 EIGENVALUES AND EIGENVECTORS 


In 9.3.7 it was observed that the relationship y = Cx could be con- 
sidered as a transformation process, the vector x being transformed into 
the vector y by the matrix C. The two vectors are of the same order 
but are, in general, of different magnitudes and directions. The matrix 
C is necessarily square. 

Suppose that the matrix C is given. Then of all the possible trans- 
formations there exists a vector x which will be transformed into a 
scalar multiple of itself, that is, such that 


Cx = Ax (84) 
where A is a scalar. 

A particular transformation such as (84) is of interest in the mathema- 
tical sense but it is also of interest to the engineer when it is found that 
equations of this form play an important role in engineering problems 
(see section 17.3.3 of Chapter 17). 

The equation (84) may be written 


(C —ADx =0 (85) 


where 7 is the unit matrix of the same order as C. 
Suppose that x is represented as the column matrix (x,, x2, Xs)? and 
C = (¢;;), with i, 7 = 1, 2, 3 then (85) is equivalent to the set of equations 


(Cy. — A)xy + CraXq + CigX3 = 0 
CaiXy + (Cog — χα + Co3X3 = 0 (86) 
C3iX1 + CaaXq + (C33 — A)xg = 0 


The system (86) has a non-trivial solution if the determinant of the 
coefficients is zero [see the end of section 8.7.2]. 


Le. Ια —al| =0 (87) 
or 
Cy — A C12 Cig 
Cor Co2 — A Co3 | = 0 (87a) 
C31 C32 C33 — A 


The equation (87) leads to a cubic equation in Δ. This means that, in 
general, there are three different values of 2 and hence three different 
vectors x for which (86) has a non-trivial solution, that is for which x 
is transformed into a scalar multiple of itself by the matrix C. The alert 
reader will observe that two of the roots of the cubic equation may be 
complex and will also note that it is possible that there are equal roots. 
It is assumed here that all the roots are real. 
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The values of 4 obtained by solving (87) are called the eigenvalues of 
the matrix C (sometimes called characteristic values or latent roots), 
and the associated vectors x are called the eigenvectors (sometimes 
characteristic functions). The equation (87) is called the characteristic 
equation. 


Example (i) Find the eigenvalues and associated eigenvectors if 


2 - 0 
C= ]-1 5 —1 
0 -—3 2 


Solution The eigenvalues are the roots of the equation 


2-- —1 0 
—1 5-4 —1)=0 
0 —-3 2-- 


that is, of 
(2 — ALS — 1) — a) — 3. + 1[-2 + A] = 0 


The equation reduces to (2 — 4) (A — 1) (A — 6) = 0 and the three 
eigenvalues are A = 1, 2, and 6. The associated eigenvectors are the 
vectors (x1, X2, X3)™ given by solving the equations 
(2 ως A)x, — Xo =0 
—~x, + (5 — χα — Xz =0 
for the appropriate eigenvalue. 


If A = 1, then x, = x2 and 3x, = x3 and, choosing the value of x, 
arbitrarily leads to the eigenvector 


ι 


If A = 2, then 
Χο = 0 and —x, + 3x, - χε = 0 
which leads to x, = O and x; = —xg. The associated eigenvector is then 
B 
0 
ee δε τ 
where f is arbitrary. 
If A = 6 then 
—4x, — Xe ΞΞ 
-χ — XxX, —x,=0 


—3x, —4x3 = 0 
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which leads to 4x, = ~—x,, and 3X2 = —4x,. The associated eigenvector 
(letting x, = 7) is 
7 
3y 
where y is arbitrary. 
It is common practice to remove the arbitrariness of an eigenvector 


by making it of unit magnitude. Thus in the example above the three 
vectors would become, respectively 


Π{ΜΊ1 1{ν2 1/4/26 
(i), ( 0 ] and (-</) 
3/V/11 —1//2 3/4/26 


The eigenvectors are then said to be normalised. 
In a geometrical sense the above example shows that the trans- 


formation matrix 
2 -—1 0 
C= (- "πὶ 
1 —3 2 


Vil), ae 
will transform the vector χα) = ( [V1 ἢ into the vector y given 


ayy ~ a 
Wf[f/2\ 
or the vector x) = ( i Ἐ oe se Ae oe 
yy = Ae 
—1//2 
1//26\ , 
or the vector x = (- 4/ vas epee ya ia given 
3/4/26 " 


Example (ii) Find the eigenvalues of the matrix 


--Ἰ 2 0 
C= Ι 0 1 
1 - 0 


and the associated eigenvectors x such that Cx = Ax. 
Solution The characteristic equation is 


-1-- 2 0 
I --Ὺλ 1/=0 
1 -t -λ 


which reduces to (A + 1)%(A — 1) = 0. There are three values of A but 
only two are different, 1.6. A = 1 and4 = ~1 (twice). 
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If A = 1, the eigenvector is given by solving 


—2x, +2Xx2 => 0 
ΧΙ;  —-X_g Ἔχετξεῦ 
xX, —Xq τ τὰ > 0 


which leads to x3 = 0 and x; = x2. The normalised eigenvector is 


1/+/2 
("39 
0 
If A = —1, the equations for x reduce to 
2X2 =0 
xX, Ἔσχε +x, =0 
xy —Xq +X, =0 
which give x, =0 and x, = —X3. There is only one eigenvector 


associated with the double root and this is 


1]/4/2 
0 
—1/4/2 
in normalised form. 


In many practical problems the matrix C is symmetric (i.e. 
C1; = + cy) and it may be shown (but not here) that in this case there 
is a distinct eigenvector for each eigenvalue whether repeated or not. 


Example (iii) Find the eigenvalues and the associated eigenvectors for 
the symmetric matrix 


a. 2 4 
Cat 2 2 -2 
4... .: 


Solution The equation for the eigenvalues is given by 


—1—A 2 4 
2.2--λ —2/=0 
4 —2 --Ι-- 
which reduces to the cubic equation (A — 3)?(A + 6) = 0. 
If A = —6, the eigenvector is given by 


5X, +2x,. +4x,=0 
2x, +8x, ~—2x,;=0 
4x, --2χι Ἔ5Χ8 = 0 


Simple Gaussian elimination (see 8.6.1) leads to ~x, = 2X2 = X3, and 


the eigenvector is 7 
—2 
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2/3 
(2) 
—2/3 
in the normalised form. 


If 4 = 3, the eigenvector is given by solving 
—4x, 42x, +4x,=0 
2x, — χς —2x3=0 
4x, -—2x, —4x,=0 


where « is arbitrary, or 


The three equations have a redundancy and are equivalent to the single 
equation 2x, — x2 — 2x3 = 0, hence two of the variables, say x, and 
Xs, can be assigned arbitrary values β and y, and the vector takes the 


fornr 
B 1 0 
β' - »| or β β) Ἔγ [-: 
γ 0 I 


In this form the eigenvector is a linear combination of the two eigen- 


vectors 
1 0 
ἢ and [- 
0 ] 


It is an additional property of a symmetric matrix that the eigen- 
vectors are orthogonal, i.e. the scalar products of the vectors are zero. 
In Example (iii) the vectors (1, 2, 0)? and (0, —2, 1)” are orthogonal to 
the eigenvectors (2, —1, —2)? since 


2 2 
(2 [-}-ὸ and (0 -- 2 1){-1] =0 
2 2 


It follows that β and γ may be chosen to give any pair of eigenvectors 
which are orthogonal to each other and to the third eigenvector. For 
example, suppose the vector with β = 1, y = 0, ie. (1, 2, 0)? is called 
a and the vector associated with A = 6 is called b then the third vector 
perpendicular to the plane of these two may be called ¢ and the three 
vectors are mutually orthogonal if ¢ = a x b. 

Thus ς may be written 


ij k 
e=axb=/1 2 0 
2. sath 2 
= —4i + 2j — 5k 


where i, j, and k are the usual orthogonal base vectors in the Cartesian 
frame Ox1%2X3. 
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The vector ς then has the form 


and it may be verified that 


are mutually orthogonal. 


9.12 THE GRADIENT OF A SCALAR FIELD 


A scalar function of position (and possibly time) V(x, y, z, t) together 
with the region over which the function is defined is called a scalar 
field. Examples of such fields are the temperature T in a heated region 
or the electrical potential in an electrical conducting medium. 


9.12.1 Two-dimensional scalar field 


A two-dimensional scalar field may be conveniently represented geo- 
metrically by drawing /evel curves. 


V(x, y, 1) = c = constant 


For example, the Fig. 9.33 shows a possible set of equi-temperature 
curves across a rectangular region heated on two sides and held at 


T=0 
Fig. 9.33 


zero temperature on the other two. If the temperatures 7, and 7) 
are variable in time then the picture of level curves will be different for 
different times. The scalar, i.e. the temperature, is then called unsteady. 
However if the scalar function is independent of time there is only one 
level curve picture and the field is called steady. If the rectangular 
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region of Fig. 9.33 was initially at zero temperature and the two sides 
were suddenly raised to, and maintained at, temperatures 7, and Τ᾽, 
(constant with respect to time), then the temperature distribution over 
the region would be time dependent long enough for the heat to flow 
over the entire region. After a sufficiently long period of time the dis- 
tribution would become steady and the level curve picture would be 
the same for all subsequent time. 

Suppose that V(x, y) is a steady two-dimensional scalar field and P 
is any point on the level curve (see Fig. 9.34) V(x, y) = k = constant. 
Let N be an arbitrary point (x + dx, y + dy) on an adjacent level curve 
corresponding to V + dV and let PN be at angle θ to the axis of x. 
If's is a spatial distance measured in the direction PN such that PN = 
ὃς then the rate at which V is changing in the direction of PN is given by 
Limit (6V/ds) as 6s 0, 1.6. by dV/ds. This is called the directional 


x 


Fig. 9.34 


derivative of V (in the direction PN). Using equation (26) of 5.6.2 and 
noting that V is a function of x and y, as well as a function of 5. 
dV _ Vdx ov dy 
ds ~ 8x ds * ὃν as 
oV ov. 
Figg eee ogne (88) 


=acos@+ bsin@ 
= V(a? + 6?) sin (6 + «) (89) 
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in which, tan « = a/b, and a = 0V/éx, ὃ = δ [ὃν are the rates of 
change of V in the directions of x and y respectively and are independent 
of 6. 

From (89), it follows that the numerically least value of dV/ds is zero 
and that it occurs when θ = 0,,;, such that 


Onin = τα (90) 
: OV [dx 
ie. tan Onin = —a/b = -- av [oy (91) 
But if V(x, y) = k = constant, it follows (see section 5.6.2) that 
dy aV jex 
dx aV/ay 92 


and comparison of (91) and (92) shows that 6,,in is the angle of slope 
of the tangent to the level curve. Hence the minimum value of the 
directional derivative is zero and occurs when measured in the direction 
tangential to the level curve at P. The maximum value of dV/ds is 
4/(a? + 63), i.e. 

V/({aV ax}? + {eV /ey}*) (93) 


and occurs when 0 = Oma, such that 
Onax +ao= ἐπ (04) 


Thus the directional derivative is a maximum when taken in the direc- 
tion perpendicular to the level curve, i.e. in the direction of the normal 
to the curve and in the sense of increasing V. 

This maximum value of directional derivative together with its 
direction perpendicular to the level curve form a vector quantity which 
is called the gradient of V and written grad V or in symbolic form VV. 
Thus grad V is a vector of magnitude +/{(@V /éx)? + (V /éy)?} and of 
direction specified by 9max, i.e. by the slope 


tan Onax = cot « 
= bla 


_ oviey 
~ OV [ex 


The vector grad V thus has components in the x and y directions given 
by aV/éx and aV/éy respectively. It follows immediately that 


av. 
ey 


where i and j are the usual unit vectors in the x and y directions respec- 


ov 
grad V=We_=s7it (95) 
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tively. In the alternative notation 


eV OV 
grad V=VV = Be, ἢ + παν: 
ΟΝ 
Ξ 5, ὁ (96) 


using the summation convention on the symbol i. 


9.12.2 Three-dimensional scalar field 


The definition of grad V in (95) and (96) may be extended immediately 
to a three-dimensional field, i.e. 


av. av. av 
ue SNE Sh a Pe a, (97) 
or alternatively 
ove 
grad PG Gt 3. (98) 


In this case the geometrical representation of the field requires a three- 
dimensional picture of level surfaces V(x, y, z) = constant. The vector 
grad V is then of magnitude 


Vi(@V ox)? + (BV /ey)? + (@V/éz)?} 


at the point P and in the direction of the normal to the surface (ice. 
perpendicular to the tangent plane at P). 

Note The symbol V is pronounced de/ (sometimes nabla) and is a 
vector operator, i.e. an operator which when applied to V gives the 
vector grad V. It is often written 


ve ὃ, ὃ. Os 
“ax taylt a 


Example (i) Find grad V if V = x°y + 3xy2z — 2x2z? 
Solution Here 
ov ov 


dV a j k 
gra “te Falta 
= (3x?yp + 3y?2z — 4xz7)i + (x3? + 6xyz)j + (3xy? — 4x?z)k 


Example (ii) Find the equation of the normal to the ellipsoid 
2x? + 3y? + z? = 21 at the point (3, —1, 0). 


Solution Let V(x, y, 2) = 2x? + 3y? + z?, for all x, y, Ζ, be a scalar 
field, then grad V is perpendicular to the level surface V(x, y, 2) = 21, 
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i.e. is in the direction of the normal to the surface of the ellipsoid. Now, 
grad V = 4xi+ 6yj+ 2zk 
= 12i—6j+0k 
at the point (3, —1, 0). The equation of the required normal is 
Hla Ny SE 


2 πὸ Ὁ 
The normal is the line 2y + x = 1, z = 0, ie. the line 2y + x = 1 in 
the (x, y) plane. 


Example (iii) A particle at the point P (x, y, z) is subjected to a force 
F whose magnitude is inversely proportional to the square of the dis- 
tance OP and whose direction is towards the origin 0 (the inverse square 
law). Verify that there exists a scalar potential function V which is 
inversely proportional to OP and such that F = grad V. 


Solution Let OP = r 


then V =k/r 
where k is a constant and r = +/(x? + y? + 2?). 
Then grad V = kV(1/r) 


= k(—1/r*)V(r), 


using the chain rule of differentiation 


ὃ (ἡ 4 ξ- τ) 
ax \r)  ἀλν) ox ἃ r?) ax 


and two similar results for differentiation with respect y and z. 
Now v2 = χε y? + 2? 
and so 2rér/ox = 2x, 
ie. Or/Ox = x/r, with similar results ér/éy = y/r, and ér/éz = z/r. 


Finally, 
F = grad V = k(—1/r?) (=i ἘΠῚ + =x) 


= —(k/r*)r. 


This last result shows that F is proportional to (1/r?) and that F is in 
the direction of —r, i.e. towards O. 


9.12.3 The directional derivative 


Suppose a is a given vector, then the directional derivative of V in the 
direction of a is dV/ds where s is measured in the direction of a. 
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where δὴ is measured in the direction of the normal to the level curve 
or surface (see Fig. 9.35). 


Fig. 9.35 


Thus, 
dV OV. on 


a = Lim an Lim bs 
= cos 0 
ΒΕ. τς 
~ dn |al 
_ @V/dn)n.a 
fal 
where n is the unit normal and 6 is the angle between n and a. Finally, 


dv VV.a_ grad V.a 
ΕΝ a 
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Example Find the directional derivative of V = 3xyz — x°y + 2xz® 
at the point (1, 1, 1) in the direction specified by the join of (2, 1, 0) to 
(—1, 3, 4). 


Solution The required direction is a and 
a= —3i + 2] + 4k. 
The directional derivative in the direction of a is 


[yz — 2xy + 22z)i + 3xz — χ Ἢ + Gxy + 4xz)k) . [--31 + 2j + 4k] 
V(9 + 4+ 16) 
which at the point (1, 1, 1) becomes 23/4/29. 


EXERCISE 9 


1 State which of the following are scalar quantities and which are 
vector quantities: speed, weight, pressure, time, voltage, tension, 
momentum, temperature. 


2 Two aircraft start from the same point on the Equator. The first 
travels 500 kilometre due East and then 500 kilometre due 
North while the second travels 500 kilometre due North and 
then 500 kilometre due East. Do they arrive at the same point? Are 
these directed magnitudes vectors? 


3 Free vectors a and b are inclined at an angle 9, show that 
[4] — |b] < ja + Ὁ] = (a? + b? + 2ab cos 6) < [4] + [5]. 


4 The position vectors of the four points A, B, C, and D are respec- 
tively, a, b, 3a + 2b, and 2a — b. Express AC, DB, CB, and CA in 
terms of a and b. 


5 In any triangle ABC prove by vector methods that (i) the straight 
line joining the middle points of AB, AC is parallel to BC and half 
the length of BC, and (ii) if D, E, F are the mid-points of BC, CA, 
and AB respectively and O is any other point, 


OD + OE + OF = OA + OB + OC. 


6 Two forces act at the corner A of a quadrilateral ABCD and two 
at the corner C. The forces are represented in magnitude and 
direction by AB, AD, CB, and CD respectively. Show that their 
resultant is represented by 4PQ where P and Q are the mid-points 
of AC and BD respectively. 


7 If a and b are the position vectors of two points A and B find the 
position vector of a point C on AB produced such that AC = 3AB. 


8 Vectors from O to A, B, C, and D are represented by a, b, ς, and ἃ 
respectively. The joins CB and AD intersect at P. If AP = $AD 
and CP = 1CB show that d — b = 2(ς — a). 
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9 A, B, C, and G are the points (2, 1, 3), 3, --1, —2), (—2, 3, 5), and 
(1, 1, 2) respectively and P is any point. Show that PA + PB + PG 
= 3PG and state what can be deduced about the point G. 


10 Find the magnitude and direction of the resultant of the vectors 
(Ὁ 71 + 2j — 5k, 3i — j — 2k, —2i — 2| + k; (ἢ of magnitudes 
6, 5, and 3 in the directions [2:1 :2], [4:3:0], and [2:2:—1] respec- 
tively. 


11 Find the centre of gravity of masses 3, 4, 5 at the points (2, —3, 3), 
(5, ~3, —4), and (2, ~3, —1) respectively. 


12 Forces of magnitudes 1, 2, and 3 act at one corner of a cube along 
the diagonals of the faces meeting at that corner, find the magnitude 
of their resultant and its inclination to the edges of the cube. 


13 The position vector of a point A is 2i — 3j + k. Find the vector 
equation of the line through A in the direction of the vector 7] + k. 
Find the coordinates of the point Bon the line such that AB = 6/2. 


14 Find the equation of the line passing through the points (1, I, 1) 
and (—2, —1, 3). 


15 A and B are the points whose position vectors are 3i + 4j — k and 
—3i + 7j + 2k respectively. Find the position vector of the point 
that divides the join of AB (i) internally in the ratio 3:4, and (ii) 
externally in the ratio 4:7. 


16 A force of 13 units acts at the point (1, 3, —4) and its line of action 
is parallel to the join of the point (1, 0, 3) to the point (5, 1, 7). Find 
the vector form of the force. 


17 Find the scalar projection of the vector 2i — 3] + 7k on to the 
vector 61 + 4j — k. 


18 Find the vector projection of 2i + 3j on to the vector 4j + 5k. 


19 Show that the perpendicular distance from the origin to the line 
r=a-+ tb is given by |a — (a. b)b/b?|. Find the perpendicular 
distance from the origin to the line 3j + 4k + 1(51 + 5j). 


20 Show that the perpendicular distance from the point whose position 
vector is ¢ to the line r= a+ tb is |(c — a) x bj/b. Find the 
perpendicular distance from the point P(—1, 3, 4) to the join of 
A(2, 3, 5) and B(—6, 7, 9). 


21 A particle moves in a straight line from a point A(O, 0, 0) to a point 
B(0, 1, 2) and then in a second straight line to a point C(I, 0, 3) and 
finally in a third straight line back to A. It is acted on by a force 
2i + 4j + 3k while moving along AB, by a force 2i + 3k while 
moving along BC and by a force —3k while moving along CA. 
Find the total work done by these forces. 
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22 Prove that the vectors a=2i+j+2k, b=i+ 3] — 7k, 
ς = 3i + 4j — 5k form a right angled triangle. 


23 Find the area of the parallelogram whose sides are the vectors 
i+ 3] -- Κ, 2i—j+ 2k and determine a unit vector normal to 
its plane. Find also the equation of the plane. 


24 What is the unit vector perpendicular to each of the vectors 
2i — j +k and 3i + 4j — k? Find two unit vectors in the plane 
of these vectors, one perpendicular to each of the given vectors. 


25 Find the moment about a point i + 2j — 2k of a force represented 
by 3i+ 4k acting through the point whose position vector is 
i—j + 3k. 

26 The vector ς = /i+ mj+ nk is perpendicular to the vectors 
a =i + 2k and b = —i + 2j + 3k. Show that /:m:n = 4:5: —2, 
(i) by solving the equation c.a = 0 = ς . b, and (ii) by evaluating 
a xb. 

27 Forces i — j + 3k, —3i + 3j — 2k, and 2i — 2] —k act at the 
points (1, 1, 1), (2, 3, 0), and (—3, 4, 1) respectively. Show that they 
are equivalent to a couple of moment +/138 about an axis whose 
direction ratios are [—4:1:11]. 


28 If ax b=n and n.c =a where a = a,i + ask, b = b,i + δ], 
and ¢ = cj + csk show that « = 4,820 + agbiCo. 

29 Show that the line through the point (0, 4, 4) in the direction of the 
vector [—1, 2, 1] intersects the line through the point (1, 4, —1) in 
the direction of the vector [0, 1, —2] and find the point of inter- 
section. ᾿ 

30 Find the shortest distance between the lines 

(x + 3)/2 = (y — 6/3 = ( — 3)/(—2) 
and x/2 = ἡ — 6)/2 = z/(—1). 

31 Find the triple scalar product of the three vectors a = 2i —j + k, 

=i—k, ande = 31 Ὁ 2j. 


32 Find values of β if 21+ Aj +k, fit+j+k, and fi + 3] — 3k, 
are coplanar vectors. 


33 Find the coordinates of the point of intersection of the line 
r=a-+¢b and the plane r.n=8 when a= 8i+ 7] + 12k, 
b = 3i + 4] + 6k, and n = 4i + 3k. 
34 Show that the system of equations 
3x -- 2ν Ὁ z=1 
2x + 4y --22- 3 
Tx + 6y —3z=7 
is consistent but indeterminate and illustrate the result geometrically. 
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35 Show that the system of equations 
2x + 3y -- 72 - 2 
x— γ Ξε 4 
3x + 2y -- 12 - 1 
is inconsistent and illustrate geometrically. 
36 Prove that (a x b). (a x c) = (Ὁ. c)a? — (a. bla. ο). 
37 ABC is a triangle with a right angle at C. If CB = a, CA = , and 


AB τε use the scalar product of vectors to prove Pythagoras’ 
theorem. 


38 P,, Pz, and P3 are the points whose position vectors are rj, r2, and 
Ts respectively. Show that the equation of the plane through the 
points is given by 

( -- τι). (τ — τι) X (τ — τ) =0 
and express the result in the form of a determinant. Find the 
equation of the plane through the points (2, 1, 4), (2, 0, 1), and 
(- Ι, 3, 0). 
39 Given r = cos ti ~ sin 2tj + εἶ find dr/dt, dr/dt? and |d?r/dt?|. 
40 The position vector of a point at time ¢ is 
r= 2ti+ 3 fj — 2k. 
Find the velocity and acceleration and show that 
4v? = 16 + f* 
where v and fare the magnitudes of the velocity and acceleration. 

41 Ifr = asin wt + bcos wt where a and b are constant vectors, show 
that d*r/dt? = —w?r. 

42 Show that r = 2icos ¢ + 2] sin ¢ lies on a circle for all t. Show that 
the equation of the tangent at the point ἢ, is 

r = 21 008 ἡ; + 2jsin t, + «(2jcos ἡ; — 2isin 1) 
where « is a parameter. 

43 By differentiating ἃ. ἃ show that a.(da/dt) = a(da/dt). If a is of 
constant length deduce that da/dt and a are at right angles. 

44 A particle moves on a circle r = a with a variable angular velocity 
w. Find the components of acceleration. 


45 Verify the expressions 7 and r6 for the velocity components in polar 
coordinates by differentiating and resolving the relationships 
x =rcos 6, y = rsin 6. Also verify the acceleration components 


id_,, 
yo 2 — —(p2 
F — τὸ, τα θ). 
46 If ἃ -Ξ 211 -- 177+ 2Κ, ὃ τὸ 31: -Ὁ ἢ τ- 2ΙΚ, and c=?k find 


d(a.b)/dt, d(a x b)/dt, and d(a.b x ©)/dt, (i) by differentiating 
the products, and (ii) by first evaluating the products. 
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47 A body moves on the curve r = ae®° where a and « are constants 
(the equiangular spiral) in such a way that d6/dt is constant and 
equal to w. Find the polar components of the velocity and show 
that the resultant velocity makes a constant angle with the radius 
vector. Find also the polar components of the acceleration. 


48 A particle is moving such that its acceleration at time 1 is 2i + rk. 
If it initially passes through the point (1, 2,3) with a velocity 
2j — k, find the parametric equations of its path. 


49 Obtain the eigenvalues and normalised eigenvectors of the matrices 
11 
(i) (: a (i) (2 2 Ἶ 
3.3.2 
50 Given that the matrix 
11 πό 2 
πό 10 - 
2-4 6 


has an eigenvalue 18 and corresponding eigenvector 


2/3 
(ἢ 
1/3/, 


find the other eigenvalues and eigenvectors. 


51 Find the eigenvalues and three mutually orthogonal eigenvectors 
of the symmetric matrix 


011 
101 
i 1 Of. 


52 Find the angle between the normals to the surfaces x? + y? 4 z? = 14 
and z= x? y? — 2 at the point (1, — 2, 3). 


53 P is a fixed point 2i — j + Καὶ and Ο 15 the general point xi + yj + zk. 
The function ¢(x,y,z) is the square of the distance PQ. 
Evaluate grad ¢, and find the directional derivative of ¢ in the 
direction i + 2] + k when Ὁ has position vector 3i + j — 2k. 


54 A particle is constrained to move along the helix 
r(s) = cos ὡδὶ Ὁ sin wsj + ask 


where w? + «? = 1. It is acted upon by a force field F = Vd where 
¢ = logr and r =|r|. Show that the work done by the force on the 
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particle during an incremental displacement dr is 
as 
F.dr= 1+ a2s2 ds 


and hence when the particle moves from s = 0 to s =2z/w the total 
work done is log (1 + 47?«?/w?). 


10 
Complex Numbers and Complex 
Variables 


10.6 INTRODUCTION 
From a historical point of view the theory of complex numbers arose 
from the study of the solution of algebraic equations particularly of the 
quadratic equation ax? + bx + ὁ = 0. The equation x? — 4x —5 =0 
or (x — 2)? = 9 has two unequal roots given by x — 2 = +3; the 
equation x? — 6x + 9 = 0 or (x — 3)? = 0 has two equal roots x = 3 
but the equation x* — 4x + 13=0 or (x — 2)? = —9 has roots 
which involve the finding of a number whose square is —9. Such a 
number can be represented by the introduction of a symbol i (some- 
times j is used) such that i? = —1 or i = »/—1 and the two roots of 
χῇ — 4x + 13 = 0 can then be formally written x — 2 = +i3. The 
invented number i3 was called an imaginary number, the other numbers 
such as 3, —3, or +/3 with which familiarity was already established 
being called real numbers. The numbers of the form 2 + i3 were called 
complex numbers. The main reason for the introduction of such numbers 
was for uniformity so that it could be said that any quadratic equation 
always had two roots, real and unequal, real and equal, or complex. 
The term imaginary is an unfortunate legacy from the past and 
implies that the numbers have some mystical character. From a mathe- 
matical point of view the imaginary numbers or the more general 
complex numbers are considered well-behaved providing they obey a 
consistent set of algebraic rules, while from an engineering viewpoint 
the new numbers are of immediate interest and acceptable providing 
that they can be successfully used in an engineering context. For the 
latter it is more advantageous to ignore the historical aspect of complex 
numbers and to consider the symbol i as an operator. Note that i here 
has no connexion with the unit vector i (of i, j, k) of Chapter 9. 


10.1 SYMBOL i AS AN OPERATOR 
Consider any vector OP lying in a plane and of length a units, i.e. 
a = OP. The symbol (—1) may be thought of as an operation which 


when applied to OP has the effect of turning OP through two right 
angles in the anticlockwise direction, this being the mathematically 


positive direction, and leads to another vector OQ such that 
OO = (—1)0P = (—l)a = —a. (1) 
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A ‘mid-way’ stage in this operation would be to turn the vector OP 
through one right angle in the anticlockwise direction. Suppose that 


the symbol i is given to this operation then iOP will be represented, in 
Fig. 10.1, by the vector OR such that 


OR = iOP = ia. (2) 


Fig. 10.1 


A further application of the operator i to the vector OR will then 
turn it through a right angle to the position of OQ so that 


—a = O00 = i(iOP) = [ΞῸΡ = ia, (3) 


Equation (3) shows the effect of a double application of the operator i. 
That is i or i? turns the vector through two right angles and has the 
same effect as the operation —1 of equation (1). In an algebraic sense 
(3) may be interpreted as showing the equivalence of i? and —1 and 
in this sense i could be said to be +/—1. As mentioned in section 10.0, 
the statement i = «/—1 may be used as a starting point for the historical 
development of a complex number theory. The operational approach 
is simply to consider i as an operation that turns a vector in a plane 
through one right angle and i? (or —1) as the operator that turns the 
same vector through two right angles. 


A third application of the operator i will turn OO into OS, i.e. 
OS = i(OP) = [30} = —iOP (4) 
and B= i? τῷ ~i. (5) 


The operator i? applied to OP has the same effect as the operator (—i) 
applied to OP. The first turns OP through three right angles in an anti- 
clockwise direction, the second turns OP through one right angle in a 
clockwise direction and in either case the vector OS results. 
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10.1.1 The Argand diagram 

In Fig. 10.2 let X’OX produced be an axis on which all real numbers 
from —oo to +00 can be represented, O being the origin zero. For 
example the number +3 is represented by the point P on the line OX 
distant 3 units from O and the number —3 is represented by the point 
Q distant 3 units from O in the direction OX’. These numbers may be 
represented also by the vectors OP and OQ (or — OP) but by convention 
they are simply written +3 and —3 the direction of the vector being 
understood. Now draw an axis Y’O¥ to intersect X’OX at right angles 
at O then a number such as 12 may be represented by the point R distant 


ἰἐΓ------- --τ-τς- -- ΖΞξΞα τὰν 


ἐξοιξοςι σι ἃ 


Wbeuss 


Fig. 10.2 


2 units from O in the direction OY and —i2 by the point S distant 
2 units from O in the direction OY’. Again, the number i2 may be 
represented by the vector OR being the vector +2 turned through 
one right angle in the anticlockwise direction, 1.6. OR = i(+2) = i2. 

Consider the addition of the two numbers 3 and i2. The sum is 
formally written 3 + 12 and may be represented by the point T or by 
the vector OT, the sum of the vectors OP and OR (see section 9.3.3). 
Thus in Fig. 10.2 the complex number 3 + i2 is represented either by 
the point T or by the vector OT. 

The axis X’OX containing all the real numbers from —o to +00 
is usually called the axis of real numbers or the real axis and the axis 
Y’OY containing all the imaginary numbers from —ico to +ioo is 
called the axis of imaginary numbers or the imaginary axis. It is usual 

_to let any arbitrary real number be denoted by x and to let any arbitrary 
imaginary number be denoted by iy (where y is a real number). The 
sum of these numbers is then denoted by z, i.e. 


Z=x+ ip. 
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The two numbers x and y are called the real part and the imaginary 
part of z respectively and written Re z and Im z. 

The geometrical interpretation that any complex number z may be 
represented in the Cartesian plane by a point whose coordinates are 
(x, y) where the x-axis contains all the real numbers and the y-axis 
contains all the imaginary numbers is called the Argand diagram. The 
Cartesian plane is often called the z-plane. 


Note 1 There is rarely any confusion between z the complex number 
and z the spatial coordinate of (x, γ, 2) because the algebra of 
complex numbers is applied to problems essentially of a two-dimen- 
sional nature. 


Note 2 Although y is called the imaginary part of z it is a number as 
real as x. Some writers call iy the imaginary part of Ζ. 


Note 3 Too much emphasis should not be placed on the representation 
of a complex number as a vector because of the operator i in the 
representation. For instance if e, and e, are two orthogonal unit base 
vectors in a Cartesian coordinate system then by equation (33) of 
section 9.6.1, e,.e, = 1 and e,.e, = 1 but if n is a unit vector 
along OX in the Argand diagram so that in is the unit vector along 
OY then n.n = 1 whereas (in). (in) = —1, ie. a different product 
is obtained depending on the direction taken. 


Note 4 The real numbers are a particular case of the complex numbers. 
Note 5 Numerical complex numbers are frequently written in the form 
3 + 2i instead of 3 + i2. 


10.2 THE ALGEBRA OF COMPLEX NUMBERS 


The rules of common algebra may be applied to complex numbers 
providing that any product i? is replaced by —1. 


10.2.1 Equality of complex numbers 
Two complex numbers are equal 


X, + iy, = χα + ive 


if and only if x, = x, and y, = Ya, as is immediately obvious by 
plotting the two points on the Argand diagram. The zero complex 
number is 0 + i0 represented by the point O in the Argand diagram. 
A complex number is zero if and only if its real and its imaginary parts 
are zero. 


10.2.2 Addition and subtraction of complex numbers 

The sum of the complex numbers z, and zz is given by 
G+ iy,) + (x2 + iva) = ας + X2) + iG + Yo) 

and is identical with vector addition. 
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The difference of two complex numbers z, and zz is the sum of Ζι 
and —2Zg, 1.6. 


Ζ, — ΖΩ = Ζ: + (—22) 
= (x1 + iyi) — (χα + ipo) 
= (x, — X2) + i — »). 


Fig. 10.3 


In Fig. 10.3 the points P and Q represent z, and z, respectively, R is 
the sum z, + z, and S is the difference z, — Ζᾳ. It is sometimes an 


advantage to represent z; — zz in vector form by OS or its equivalent 
vector QP. (Compare with section 9.3.3.) 


10.2.3 Multiplication of complex numbers 


Given two complex numbers z, and z, the product is another complex 
number z, such that 


Z3 => 2129 
= (x, + iyi)&%2 + iyo) 
= (x1X%2 — Vive) + ἰ(χι)ὰ + X2)1). (6) 


The result (6) shows that multiplication is commutative, i.e. 
ΖιΖα = 2973. 
The product of two complex numbers is real providing that 
χα} ΞΞ —Xet γα. 


In particular the numbers x, + iy, and x, — iy, whose product is 
x? + y? are called conjugate complex numbers. If z = x + iy is any 
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general complex number the conjugate is denoted by Ζ, so that 
z= x — iy. In the Argand diagram Z is the reflection of z in the real 
axis. Note that 


Ζι -Ὲ Ζᾳ ΞΞ Ζ: :Ἐ Ζμ 
ΖιΖῳ = Ζ,Ζ,. 
Example (i) (3 + 2i)+ (5 —7i)=8 -- δὲ 
(3 + 2ὴ -- ὁ -- Τὴ = —24 93 
6(1 — 5i) = 6 — 30i 
3(2 — 3i) — (7 — 2) + (1 + 71) = (6 — 91) ~ (6 — 97) 


= 0 + i0. 

Example (ii) (4 + 213 — i) = (124+ 2) + [(6 -- 4 
= 14 - 2i. 

Example (iii) (1 + 2%) =i + 213 
= --2 -Ἐἰ, 


Note that if (1 + 2) in example (iii) is represented by the vector OP 


then i(1 + 21) is represented by the vector OQ obtained by turning OP 
through one right angle in the positive sense. 


10.2.4 Division of complex numbers 


The value of (x, + iy:)/(x2 + iy.) is defined as a third value ¥ + iY 
such that 


χὰ + iy = (ἡ + iva (X + iY) 
= (οἵ — γω Υ) + ie Y + Xyp). 
From the equality of complex numbers it follows that 
X_X — yoY = x1 


and ΧΩΥ͂ + Xye= γι 
andhence X¥+iY= Mika + Via aya = χιγῳ) ria 2), 


χὰ Ὁ} ἜΣ 
This result can be obtained by an alternative but equivalent process as 
follows. The numerator and denominator of the quotient 


(x1 + iys)/(x2 + νὼ) 


are each multiplied by the complex conjugate of the denominator so 
as to make an equivalent quotient with a real denominator. 
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αἱ + tye — tye) 
(x2 + iva)(%2 — tye) 
_ (Xe + YrYo) + rays -- χι)) (7) 
xa + Ya 
The result (7) need not be remembered but each division required 
should be evaluated using this alternative process. 


Thus X+iY= 


Note that (7) reduces to x,/x2 if y, = Ὁ = yo. 
3+ 2i) (3+ 2144+ 31) 
G3)” @= 34 + 3) 
(12 — 6) + i(9 + 8) 
~~ (16 + 9) 
= (6 + 17i)/25. 
ατὴῶ- ὃ 3+ 
4 --ἨνΜΝϑ ἢ 4-- 2ὶ 
(3 ὴ(4- 2ὴ 
16:4 
= (1 + i)/2. 
10.3 POLAR FORM OF A COMPLEX NUMBER 


Any complex number z = x + iy may be represented by the point P 
on the Argand diagram as in Fig. 10.4, i.e. by the point whose Cartesian 
coordinates are (x, y). 


Example (i) 


Example (ii) 


Fig. 10.4 


It is frequently an advantage to represent the point in terms of its polar 
coordinates (r, 0). Now, 


z=xt+iy=rcos6+ ir sin 0 
=r(cos@ + isin 6). (8) 
In abbreviated form this is often written r cis θ or r/[6, so that 
r(cos 6 + isin 6) =rcis 0 =r/0. 
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10.3.1 Modulus and argument 
The polar coordinate r is a positive quantity being the magnitude of 
the vector OP. It is called the modulus of z and is written mod z or 
|z\, ie. 
mod z = [2] =r = o/(x? 4+ y?). (9) 
The polar coordinate 9, giving the direction of the vector OP, is 


called the argument or amplitude of z and written arg z, amp z, or 
am z. The value of θ is found from 


tan 6 = y/x (10) 
the quadrant in which 6 lies being determined from the equations 
x=rcos#, y=rsin6, —180°< 6 < 180°. 


It is sometimes convenient simply to plot the complex point on the 
Argand diagram and hence to determine the quadrant in which 6 must 
lie. The value of 6 within the above range is often called the principal 
value of argz. Note that —1 = 1/180° and that if z=~r/6 then 


Ζ τεῷ γ[--θ.Ψ 


Example (i) Express 2 + 31 in the polar form. 


Solution r= VJ/(44+9)= V13 
and tan θ = 3/2 
6 = 56° 19’, 


In abbreviated form 2 + 3i = 1/13/56° 19’ = 1/13 cis (56° 19’). 
Example (ii) Express —2 — 3i in polar form. 


Solution Here, r= ν 13 
and tan 9 = —3/—2 
6 = —(180° — 56° 19’) = —123° 41’. 


10.3.2 Multiplication and division in polar form 
Let 
Z, = r,(cos 6, + isin θ}) 
and Z_ = r2(cos 62 + isin 6) 
then the product z,z, is given by 
Z1Zg = Mire (cos 0, + isin 6,)(cos 6. + isin 6.) 


= r,r.[(cos 6, cos 0. — sin 6, sin 62) 
+ i(sin 0, cos 62 + cos 0, sin 62)] 


= PyIg [cos (0, + 6.) + isin (6; + 9.)] (11) 


252 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 
or in abbreviated form 
(r:/9:)(r2/92) = riro/(O1 + 95) = rire cis (0, + 92) (12) 


Thus to multiply two complex numbers in polar form multiply the 
moduli and add the arguments. Similarly, 


Zz) r,(cos 6, + isin 0,) 
Ζ2 ἵαίοοβθ, + isin 02) 
hh (cos 6, + isin 8;)(cos 62 — isin 6.) 


re cos? 6, + sin? 0, 
__ 11 COS (0. — 6.) + isin (0; — 92) 
Te 1 


or, in abbreviated form, 


7/0, ry 
rie = 75 ΕΣ (13) 


Thus, to divide two complex numbers in polar form, divide the moduli 
and subtract the arguments. 

The argument of a product, or quotient, of two complex numbers 
should always be given within the range of values —180° to +180°. 
Note the following results 


[Ζ1Ζα] = [2|Ζα!; [2126] ΞΞ [Z1|/|Zal; ΖΖ.-Ξ |z|? = |2|? = γ3. 
Example (i) In polar form (—1+)(1 + i\/3) may be written 
(/2/135°)(2/60°) or 24/2/195°, Using the principal value for the 
argument this is 2+/2/—165°. 


Example (ii) Express (—i)/(2 + 2i)(—3 + +/3i) in polar form by two 
different methods. 


Solution 
--ἰ --ἰ 
@ σταρϑε V3 * (6-2/3) — [6- 2.3 
Ϊ 
~ VOV6 + Μ2) + V6 -- V2) 
_ (V6 = v2) +6 + v2) 


16/6 
_ V16/tan~* (6 + νὭί νό — V2) 
~ 16/6 


= i /tan-* (2 + 4/3) 
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; =F = 1/—7/2 
© GF INK3 F VN συν 3 ERB) 


= iw [--(π|2 + π|4 + 50/6) 
ἘΞ i [—197/12 


1 es 
= χε [35π|2 or a6 113:: 


10.3.3 Representation of multiplication and division on the Argand 
diagram 

Given a complex number z, represented by the point P or the vector 
OP in the Argand diagram, an examination of equation (12) shows that 
the effect of multiplying by zz is to rotate the vector OP through an 
angle arg z, in the positive (anticlockwise) direction and to change the 
magnitude of OP in the ratio ΤΡ Τὰν Le. 72:1 or [24 :1. Similarly from 
(13) the effect of dividing z, by z, is to rotate the vector OP through 
an angle arg z. in the negative (clockwise) direction and to change the 
magnitude of OP in the ratio 1:24]. 


Fig. 10.5 


In Fig. 10.5, P represents z,, Q represents z2, and 7 represents the 
point 1 + 10, Then R will represent the point z;z, and S will represent 
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the point z,/z, where R and S are such that the triangles OQ/, ORP, 
and OPS are similar so that 


OR OO 
OP = (OF OOR= lallzal, 
Os 1 

ΞΡ ὡς 00 or OSs = [23{{|2«]. 


and [ROP = [POS = [00] = arg Zz. 


10.3.4 Inequalities 
Complex numbers cannot be ordered in the same way as real numbers 
so that a statement such as Ζ; > ΖΩ or X,; + iy; > χα + iy, is meaning- 
less. However, numbers can be partially ordered by the size of their 
moduli, e.g. [3 + 2i| > |1 — i]. From Fig. 10.3, it is evident that 

|Z: + Za] < [Ζ}} + [Ζα] 

[21 — 22| < [22] + |Zal 

[21 — 2a| > [[2a] — [ol]. 


These results should be compared with those of example (ii) of section 
9.3.3. 


10.4 COMPLEX EXPONENTIAL QUANTITY 
In Chapters 4 and 5 it was found that for all real x 


expx=e?= Sx nlaltx4+x724... (14) 


n=0 


sin x = > (~—1)"x2"*2/QQn + 1)! = x — x9/3! + x8/SE4 0... 
n=0 ( 


15) 
and cosx = ¥ (—1)"x24/(2n)! = 1 -- 22/2! + xt/4l4... (16) 
n=0 
For complex z, a quantity exp z may be defined as in equation (14), 1.6. 
expz=1+z24 232! Ὁ 233} τι νον τοὺ Σ 24/nl, (17) 
n=0 


It is not immediately evident that exp z behaves like an exponential 
quantity and a certain amount of discussion of the properties of (17) 
is required. 


ao ao 
The series } |z|"/n! or > r*/n! is convergent for all r, i.e. for all |z|, 
n=0 n=0 


and hence the series δ᾽ z"/n! is absolutely convergent (see Chapter 3). 


n=0 
Note that if z is real, that is if z = x = r, then the series converges to 
65, 1.6. 
exp Ζ = expx =e’. 
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If z is replaced by iy then (17) gives 
exp (iy) = 1 + (Ὁ) + (2/2! + H)I/3!.. 
=(1-y/2!+y*/4l—..) - ἴ΄ὺ — y9/3! + y3/5!t-—. ἡ 
= cosy + isin y (18) 
using (15) and (16). 
To establish the result 
exp (2, + Z2) = exp (2) exp (z.) (19) 


requires a theorem concerning the multiplication of absolutely con- 

vergent series the proof of which cannot be given here. The theorem 
0 οο 

states that if S, = Σ᾽ ἃ, ἀπά S, = > b, are two absolutely convergent 
n=0 = 


3 n=0 
series, then 


fo) 
where c, = > ayby_ 1. 


t=0 
To form the result (19), let δ᾽ = exp z, and (ὃς = exp Zz 80 that 


a, = Z2/n! 
and b, = z2/n!, 
then c= Σ [et/tier yn — Ὁ}} 
t=0 


᾿ a ton-t 
= (1/n!) 2, Ἢ Ζ122 
= (Ζ; + 22)"/n! 


the last line following from the binomial theorem. Hence by the quoted 
theorem 


(5 an!) (5 zijn!) = > (4 + 22)"/n! 
0 0 0 
and the result (19) follows at once. 
Using (19) 
exp Z = exp (x + iy) 

= exp x exp (iy) 

= οὗ (cos y + isin y) (20) 
and expz is a complex number of modulus εἴ and argument y. 
The complex number exp z of (17), the sum of the series > z*/n!, has 


ο 
the properties of an exponential quantity and will be designated 65. 
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Thus (19) may be written 
e714 22 = e*1 e%2 (21) 
and (20) becomes 
e* = expz = ε΄ (cosy + isin y) 


= e* exp (iy) 
= εἴ εἰν (22) 
using (18) in the form 
οἷ" = cos y + isin y. (23) 


The equation (19) may be extended to read 
e21t2atZatiet2n = 921 652 6.28» E2n, (24) 
The polar form of the complex number may now be written 
z=x+iy =re® = r(cos 6 + isin 8). (25) 
The exponential form of the complex number is called Euler’s form and 
θ is now expressed in radian measure. 
Example i = e'7/?; —| = e'7; 1 = ec = e?!™ = οὔτ k an integer; 
V3 — i = 2e- 8; —] pi = νοῦ πα, 
2 Εἰ τῷ V/5e?, 6 = tan-*(4) = 0.46. 
Unless stated otherwise the argument @ is always expressed in 


principal value form, i.e. —7 < θ < +7. It should be noted, however, 
that given an argument 6, then 


οἰθ = efle+2ikm), kk an integer. (26) 


10.5 DE MOIVRE’S THEOREM 
This theorem states that for all n 
(cos 6 + isin 6)" = cos πθ + isin nd. (27) 
There are three cases to consider: 
(i) 1 a positive integer, 
(ii) π a negative integer, and 
(ili) π rational but not an integer. 
Case (i) 
From (25) of section 10.4, 
cos 6 + isin @ = οἱ 
and so (cos θ + isin 6)" = (e)* 
= (eee)... (e') 
= eine 
using (24) of section 10.4. Hence, 
(cos 6 + isin 6)" = cos nO + isin πθ. (28) 
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Case (ii) 
Let n = —m where m is a positive integer and write 
(cos 6 + isin 0)" = (e'%)" 
aia (ε΄) - πι 
ae 1 /(e)" 
— 1 /et™® 
using equation (24) of section 10.4. 
From equation (21) of section 10.4 with Ζὰ ΞΞ — Ζι 
1 = οῦ = e*1e7 71 
or e741 = 1 /(e*1), (29) 
Hence, (cos 6 + isin 6)" = 1/(e'™*) 
=e ἐπιθ᾽ 
= einé 
= cos n6 + isin nO. (30) 
Case (iii) 


Let n = p/q where p and q are integers without a common factor (other 
than unity), then 
(cos 6 + isin 6)" = (e!)" 
Ἐπὶ (e'9)P/9 
= (e'”)1/, since p is an integer 
= [(e'??'2)7}/¢, since q is an integer 


= (¢'79/2) 
= einé 
= (cos “0 + isin n6). (31) 
Thus, for all 
(e%)" = eine (32) 


which is Euler’s form of de Moivres theorem. Note that if n is a rational 
fraction p/q then e'?®/* is only one of the possible values of (ec)? 7, In 
general terms, from (26) of section 10.4 


efpela — (e+ 2tkr)p/a (33) 


Example (i) (1 + i)?7 = (4/2 e'7/4)27 
== 237/2 @27in/4 
= 22712 ein + 3in/4 
= 277/3 (cos 3/4 + isin 37/4) 
= 215(—1 + i). 
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Example (ii) Find cos 40 and sin 40 in terms of powers of cos 6 and 
sin 6 respectively. 


Solution Since (cos 8 + isin θ)" = cos 46 + isin 40 then 


cos 49 + isin 40 = cos* θ + 4ϊ οοβ 6 sin 6 
— 6 cos? 6 sin? 6 — 4i cos θ sin® 8 + sin* 6 
after using the binomial expansion. 
Equating real and imaginary parts (for equality of complex numbers), 
cos 40 = cos* 6 — 6 cos? 6 sin? 6 + sin* 0 

= 1 — 8cos? 6 + 8cos* 9 
and sin 40 = 4 cos° sin 0 — 4 cos θ sin® 6 

= 4sin θ cos 6 (cos? 6 — sin? 6). 


10.5.1 The ath roots of unity 


From equation (26) of section 10.4, with 0 = 0, 1 = e?!*” where k is 
an integer k = 0, 1, 2, 3,. . . . Suppose the nth roots are denoted by 
Zo, Z15 Zan + + © then 


2, = 4/1 = (e2tka)tin 
= ettkrin from equation (32), 
= cos (2km/n) + isin (2kz/n) (34) 
The expression (34) has n distinct values with argument in the range 


(--π, +7), for any ἢ successive values of the integer k. For k = 0, 1, 2, 
. . «, (n — 1) the successive roots are 


Zee = 1; z, metres 2 Ἐξ etitin: |. τὸ 
Zan = e2(n-l)imin — @- 2trin, 
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The other values of k give values of z, which simply repeat those already 
found, for example kK =n and k=n+1 give z, = e@!nm/n — @2tr 
= 1 = Zp and 2,41 = eX" + t7/m τς elm etimin —. e2ivin — 7 Thus it 
will be seen that there are 7 values for the nth roots of unity, no more 
and no less. Put another way, it follows that the equation z* = 1 has 
n distinct roots when n is a positive integer. If n is even, two of these 
roots are the real roots +1 and the rest are all complex; if n is odd 
then one of these roots is real, 1.6. +1, and the rest are complex. 

The roots given by (34) may be conveniently illustrated by plotting 
on the Argand diagram. The first root is the point 1 + i0 and the others 
are distributed on the circumference of the unit circle, (i.e. the circle 
of radius 1, centre z = 0) at equal angular intervals of 27/n as shown 
in Fig. 10.6. 


Example (i) Find the fourth roots of unity. 


Solution Here, 
Ze = Ψ1 = le2tkr/4 — Jetkmi2 


and the roots may be represented in abbreviated form by 
1 0; 1 {π|2; 1 [: 1 [32/2 
or 1+; O+7; —1+i0; and 0 --ὶ, 
Example (ii) Find the fifth roots of unity. 
Solution The roots are given by 
2 = le**ri5, k = 0,1, 2,3,4 
and may be written 
1/0 = 1; 1{2π|5 = 125: 1/144°; 1/216°; 1/288°. 


These roots are plotted on the Argand diagram in Fig. 10.7. 


Fig. 10.7 
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10.5.2 The nth roots of a complex number 
If a complex number is expressed in the Euler form then the roots may 
be found by the method of section 10.5.1. Suppose a = re” or 
rei(@+2k7) where r = [4] and 6 = arg a and k is an integer, then 

a/a = ail” 

a (γ1|π)ς (9 Ὁ 2ka)In, 

The roots may then be represented in the form ({Ψ)(0 + 2k7)/n. In 
the Argand diagram the roots all lie on the circle of radius 7/r, the first 
root (for k = 0) has the argument (@/n) and the other (” — 1) roots 
are distributed symmetrically around this circle at angular intervals of 
(27/n). 
Example Find the cube roots of —5 + 2i. 


Solution Write 
—5 + 2i = (29) cis 6 


where tan 9 = — ξ 
and θ = 180° — 21° 48’ 
= 158° 12’. 


The three cube roots are then 
Ζις = 298 cis (158° 12’ + k360°)/3, k =0,1,2 
= 298 cis (52° 44’ + k120°) 
= 1-753/52° 44’ + 120° 
ie. 1-75/52° 44’; 1-75/172° 44’; 1-75/292° 44’. 


Fig. 10.8 


In this form the roots may be conveniently plotted on the Argand 
diagram and in Fig. 10.8 they are all seen to lie on a circle of radius 
1-75. The roots may also be written 

1-75(0-6055 + 10-7959); 1-75(—0-9919 + i0-1265) 
and 1-75(0-3864 — 10-9223). 
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10.5.3 The roots of an algebraic equation 


The roots found in section 10.5.2 are the roots of the equation z* = a. 
It is seen that there are n roots. It is a fundamental result in complex 
algebra that any equation of the form 


f@ =2" + bz") 4+ δ. 2 +...456,=0 (35) 


where n is a positive integer, has exactly n roots, any multiple root 
being counted the requisite number of times. The equation z* = a is 
simply a particular case of (35). 

Suppose that z = « + if is a root of (35) then 

S@ ΞΞ Κα + ip) = (« + if)" + διί(α + if)"-24+...4+5,=0 


and, providing that the 5, are real, evaluation of the binomial expres- 
sions leads to 


φία, B) + ip(a, B) = 0 (36) 
where ¢ and y are known functions. 
Equation (36) is satisfied by 

(a, B) = 0 = (a, B) 


and hence it follows that 


φία, B) =~ ip(«, B) =0. (37) 
Equation (37) shows that « — if is also a root of 
Κῶ =0 


and that complex roots of an algebraic equation with real coefficients 
occur in conjugate pairs. This result can be usefully employed when 
solving such equations. 


Example (i) The roots of the quadratic equation ax? + bx +c =0 
with 4ac — b? < 0 are given by 

x=a+if and x=a—if 
where « = —b/2a and B = +/(4ac — b?)/2a. 


Example (ii) Solve the equation 
z* — 42? + 8z + 35=0 
given that 2 + i4/3 is one root. 
Solution Two of the roots are 2 + i4/3 and 2 — ix/3 and hence 


( -- 2 + in/3)(z — 2 — ir/3) is a factor of z* — 4z? + 82 + 35, ie, 
Ζῇ — 42 + 7is a factor. 
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By inspection, 

(22 — 42 + 7)(23 + 424+ 5) =0 
and the other two roots are given by 

z=—24V7(4—5)= --2 47. 


It is important to note that the proviso that the coefficients ὃ, are 
real is necessary for the derivation of (37). When the δ, are complex the 
roots of (35), while still x in number, do not occur in complex conjugate 
pairs. For example, the roots of 


2274+ 2iz+3=0 
are given by z = iand z = —3i. 
10.66 TRIGONOMETRIC AND HYPERBOLIC FUNCTIONS 
From equation (23) of section 10.4 
e® = cos # + isin 6 
and e~ ” = cos 0 — isin 6. 


Adding and subtracting leads to 


18 — 10 
cos 6 = a (38) 
10 _ a 18 
and sin @ = " ---. (39) 
i 
Recalling the definition of the hyperbolic functions of section 4.5.1 
4 e-¢ 
cosh Φ = oe (40) 
o . e-¢ 
and sinh φ = —~ (41) 
and replacing ¢ by ἰθ gives two important relationships, 
ἐθ ἡ θ' 
cosh ἰθ = se = cos 0 
e!? — e- 0 : (42) 
and sinh i6 = ge isin 6 


Similarly, replacing 6 in equations (38) and (39) by i6 gives two other 
relationships 


-6 6 
: — = cosh @ 


e-9 — ef na ᾿ (43) 
5] = isinh 0 


cos ἰθ = 


sin 10 = 
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To each trigonometric identity in 6 there corresponds an equivalent 
hyperbolic identity obtained by replacing 6 by 10 and using equations 
(43). 


Example (i) cos(A + B) = cos A cos B — sin A sin B 
and so cos (iA + iB) = cos iA cos iB — sin iA sin iB. 
Using equations (43), 
cosh (A + B) = cosh A cosh B — i? sinh A sinh B 
= cosh A cosh B + sinh A sinh B. 
Example (ii) Since 
sin 26 = 2 sin 6 cos 6 

then, using equations (43), 

isinh 20 = 23 sinh θ cosh 6 
or sinh 26 = 2 sinh 6 cosh 6. 


10.7 FUNCTIONS OF A COMPLEX VARIABLE 


The complex quantity z or x + iy has, so far, been considered fixed, 
e.g. as 3 + 2i, and represented on the Argand diagram by a fixed point. 
If, however, x and y are permitted to take on different values then z 
will take up different positions in the z-plane and is then called a 
complex variable. 

For example, if z = x + iy and |z| = 1 then z is represented by any 
point on the circle of unit radius, centre the origin, i.e. on the unit 
circle on which x and y vary according to the rule x? + y? = 1. 

Instead of z being constrained to lie on some particular curve of the 
z-plane it is often required that z takes a position interior (or exterior) 
to a curve, 1.6. a position within some region of the z-plane. For 
example, if z = x + iy and |z| <1 then the complex variable z is 
represented by any point interior to the unit circle x? + y? = 1 of the 
Argand diagram. 

Any expression that contains the variable z, such as z?, 1/z, εἴ and 
takes a single value for any given value of z is called a function of the 
complex variable z. This function is usually abbreviated to f(z) or 


w(z), 1.6. 
w = fz) = f(« + iy). (44) 
Example (i) If w = z? then 
w = (x + iy)? 
= x? — y? + 2ϊχν. 
Example (ii) If w = 1/z then 
w = 1/(x + iy) 


= (x — iy)/(x? + y”). 
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These two examples show that a function of a complex variable is itself 
a complex variable, i.e. 


w=u+iv = f(x + iy) (45) 
and u = u(x, y), v = v(x, y). 


10.7.1 Mapping function 

If z is represented by a point on the Argand diagram then w will be 
represented by another point of the same Argand diagram. For example 
if z= 1 +i then the result above shows that w = z? is represented 
by the point 0 + 27. However, it is usual to indicate the point z on 
one Argand diagram, i.e. the z-plane, and to indicate the correspond- 
ing point w on a second Argand diagram, i.e. the w-plane with axes of 
u and v. To each point of the z-plane there corresponds one point on 
the w-plane. As z covers all points on some curve 1.1 or within some 
region R, of the z-plane, the points w cover corresponding points on 
some curve L, or some region R, of the w-plane. The functional rela- 
tionship w = f(z) is said to map or to transform the curve L,, or the 
region R,, into the curve L2, or the region R,, and is called a mapping 
or transformation function. 


Example (i) Consider the mapping function w = z? and the imaginary 
axis x =0,0<y< ©. 


Solution The curve Ly, the line x = 0, 0 < y < οὐ, is mapped into the 
curve L, given by 


u + iv = (x? — y?) + 2ixy, with x =0,0<y< οὦ 
or 
u=—y?; v=0; 0<y< +o. 


Thus, as z moves outwards from the origin along the positive imaginary 
axis, w moves outwards along the negative real axis. 


Example (ii) Examine the region into which the first quadrant of the 
z-plane is mapped by the function w = z?, 


Solution Let z = re® and w = pe’, 


then pe’? = γ3οϑιθ 
= 7? 
ie. . . 
$= 29] 


Figure 10.9 shows the region of the mapping, the first quadrant of the 
z-plane is specified by 0 < 6 < π|2,0 <r < o and the corresponding 
region of the w-plane by 0 - ὁ <a, 0 <p < oo. The region of the 
w-plane is the upper half of the w-plane. 


Example (iii) Find the region into which the semi-infinite strip 
0<y <7, —0 <x <0 of the z-plane is mapped by the mapping 
function w = e?. 
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Solution In this case 
w = pel? = e? = et + 
p=e 
aed 
In Fig. 10.10 the semi-infinite strip is represented by the letters A, B, 
Cc, D. 


Along AB; y=0, -m<x<0 
hence @=0, 0<p<il, 
which is represented by A,B, in the w-plane. 


1.6. 


Ζ - plane w plane 
Fig. 10.9 


Ζ - plane 
Fig. 10.10 
Along BC; x=0, O<y<r 
hence p=1, 0<¢<z, 
which is represented by the arc B,C. 
Along CD; yer, 0>x>~-o0 
hence φτεπ, 1>p>0 


which is represented by C,D,. 
At an interior point of the semi-infinite strip, x <0 and hence 
pee<l. 
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Thus the interior of the semi-infinite strip 0 <y <7, ~o<x<0 
is mapped into the region 0 <¢ <7,0<p <1, ie. into the interior 
of the upper half of the unit circle of the w-plane. It may be noted that 
the boundary ABCD is mapped into the boundary A,B,C,D, and that 
the points A and D representing points at infinity in the z-plane map 
into coincident points at the origin of the w-plane. 

The subject of mapping has many applications in engineering 
wherever a two-dimensional flow problem exists, notably in the fields of 
electricity, electrostatics, fluid dynamics, and elasticity. 


10.8 DIFFERENTIATION OF A COMPLEX FUNCTION 


Suppose z = x + iy to be a complex variable represented by the point 
P of the Argand diagram and let z’ be the value taken by z at an adjacent 
point Q as shown in Fig. 10.11. Consider the limiting value of the ratio 


fle) ~ fa) ae 


Ζ' -- Ζ 


as Ζ' —» Ζ, where f(z) is a function of the complex variable z. 


Fig, 10.11 


There is no restriction on the way in which z’ approaches z, that is, 
no restriction on the path taken by Q as it approaches towards the 
point P (a number of different paths are shown in Fig. 10.11) and it 
would be expected that the limit of (46) (assuming it to exist) would be 
different for different paths. In general this is true but it is a remarkable 
fact that there exists a class of functions for which the limit of (46) is 
unique, i.e. is the same for all paths taken from Q to P. Such functions 
are called analytic functions and it is not altogether surprising that the 
theory of analytic functions has been extensively studied and has many 
applications in both theoretical and practical engineering. 


10.8.1 The derivative of f(z) 
When the limit of (46) exists and is unique it is denoted by /’(z) or 
dfjdz and is called the derivative of f(z) with respect to z. 


COMPLEX NUMBERS AND COMPLEX VARIABLES 267 
Thus, 


Iz Apa Az 

where Az = z’ — Ζ. 

Certain conditions must be satisfied by f(z) in order that the limit in 
equation (47) is unique. Let f(z) = u(x, y) + iv(x, y), then equation 


(47) may be written 


Sf ey τς PU + Ax, y + Ay) — u(x, y) 
Ὡς ΞΘ gee [5 5051 Ὁ) te) 
yo 
. , [ (x + Ax, y + Ay) — v(x, y) 
+m [EME ADH 
yo 


It is possible to examine the limiting process as z’ > z in a general 
manner but it is sufficient to consider two particular cases. For 
the first let Ay = 0 (i.e. let 2’ — z be real) and for the second let 
Ax = 0 (ie. let Ζ' — z be imaginary). When Ay = 0, equation (48) 
gives 


i . [π|χ + Ax, y) — u(x, y) 
f@) = Lim τ: 959 - 9} 
. U(x + Ax, y) — v(x, 5] 
+ i Lim a Κ' κ΄ ΑΈΒΕΙΘΟΝ ὡς, ἐκ.» ἀδ. 
0 Ax 
ou =, Ov 
= ax t Fay’ (49) 


and when Ax = 0, equation (48) gives 


ry — pa, [UY + Ay) — ulx, γ) 
F@) = Lim [MPD 
[x,y + Ay) — v(x, 2) 
sear [ Ay 
_ ou, ὃ 
i ὃν éy 
1.6. 
Γ ov Ou 
LO= 5-43 (50) 


it being assumed that the partial derivatives exist. The condition that 
J (2) has the same value in equations (49) and (50) leads to 


Gus @v ou ov 
Ox = ay and ὃν ΞΞ- ὃχ (51) 


showing that u and v, the real and imaginary parts of f(z), are connected 
by two partial differential equations. These equations are called the 
Cauchy-Riemann equations. 
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Because the equations (51) have been obtained by considering two 
particular limiting processes it is only to be expected that these are 
necessary conditions, i.e. if a function is analytic then the conditions 
(51) are satisfied. However, it may be stated, without proof, that pro- 
viding the four partial derivatives exist throughout a region and are 
continuous at all points of the region then the conditions (51) are also 
sufficient, i.e. if the continuous derivatives satisfy (51) then the function 
f(z) is analytic. 


Example (i) If f(z) = 2? then u = x? — y*, v = 2xy and 


ou ov 

ax = 2x, ay = 2x 
ou ov 

ὃ 3 ag = 


The derivatives exist and are continuous everywhere, the Cauchy- 
Riemann conditions are satisfied everywhere and hence f(z) = z? is 
analytic everywhere. 


Example (ii) If f(z) = 1/z then u = x/(x? + y?), v = —y/(x? + y*) and 
dus yp? — x? ὃν γῇ -- ? 


ππτα ΞΞΞ -ττιτι-- -. -- -Ξ------- 


The derivatives exist and are continuous everywhere except at the point 
x = 0 = y. The Cauchy-Riemann conditions are satisfied everywhere 
except at this point. The function 1/z is therefore analytic everywhere 
except at the point z = 0. 


Example (iii) If f(z) = Ζ = x — iy then u = x, v = —y and 
ou θυ 
ὃς. Ξ a = 
ou ov 
ye 0, oo 0. 


The derivatives exist and are continuous everywhere but the Cauchy- 
Riemann conditions are not true anywhere. The function Ζ is not 


analytic. 


Example (iv) If f(z) = \z|? = x? + y? then u = x7 + γ᾽, v = 0 and 


ou ev 
a 2» yee 
ou ὃυ 
By 7 ay = 9 
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The derivatives exist and are continuous everywhere but the Cauchy- 
Riemann conditions are satisfied only at the origin x = 0 = y. The 
function |z|? is analytic only at the point z = 0. 


10.8.2 Harmonic functions 

If a function f(z) is analytic within a region it is an important fact that 
f(z) then has derivatives of all orders. The proof of this statement is 
beyond the scope of this book but, in particular, the second order 
derivatives exist and hence from (51), 


o7u _ ὃϑυ ὃϑυ = e?u 
ax? Ox dy Oy ax 7 ay? 
: eu Ou 
and similarly 
a2n ὃϑυ 
a + ὃγξ = 0. (53) 


The functions u and v and hence u + iv are solutions of the second 
order partial differential equation 

eV aV 

Ox? + oF = 0. (54) 


The equation (54) is the two-dimensional form of a famous equation 
called Laplace’s equation and any solution is called an harmonic function. 
When u + iv is analytic the two functions u and v are called harmonic 
conjugate functions (note that this is a different use of the word conju- 
gate to that used in section 10.2.3). 

Many problems in engineering can be reduced to the mathematical 
problem of finding a function that satisfies the two-dimensional 
Laplace equation within some region and satisfies certain conditions 
on the boundary of the region. In view of equations (52) and (53) it 
may be anticipated that the solution of such problems can be facilitated 
by the use of complex variable analysis. 


Using the notation of Section 9.12 


δ ὃ. 
νὰ". 
δ. 8. (ὃν, av. 
ον , ay 
= a2 ὁ ay 


Following the usual vector notation V.V is written [72 and hence 
Laplace’s equation has the alternative operational form 


γῆν =0. (55) 
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Example (i) Show that the function u = cosh 2x cos 2y is harmonic. 
Use the Cauchy-Riemann equations to deduce the harmonic conjugate 
function v that makes w(x, y) = u(x, y) + iv(x, y) an analytic function 
and express w as a function of the complex variable z = x + iy. 


Solution Differentiating, 


ze = 4 cosh 2x cos 2y; 53 = --α cosh 2x cos 2») 
and hence 7?u = me + a = 0 and yu is an harmonic function. Now, 
from the first equation (i) 
5 - 5: Ξ 2 sinh 2x cos 2y 
and v = sinh 2x sin 2y + g(x) 


where g(x) is an arbitrary function of x only. The second equation (51), 
ov 


Ou 
ὃς -- ὃν leads to 


2 cosh 2x sin 2y + g’(x) = 2 cosh 2x sin 2y 
and it follows that 


g(x) =0 
1.6. g(x) = C = constant. 
Thus v = sinh 2x sin 2) + C 


where C is an arbitrary constant. 
The analytic function w = u + iv is given by 
w == cosh 2x cos 2y + isinh 2x sin 2y + iC 
= cosh (2x + i2y) + iC 

1.6. w = cosh (22) + iC 
Example (ii) Show that u = γῇ — 2y — 3x°y and v = x° + 2x — 3xy? 
+ C are harmonic conjugate functions. 

Express w = u + ivin the form f(z) where z = x + iy and find 
C ΚΟ = 1. 
Solution Differentiating, 

a?u/dx? = —6y; 0?u/dy? = by 

and 2v/8x? = 6x; 07v/dy? = —6x " Pe 
and hence 72u = 0, V2v = 0, and u, v, are harmonic functions. 
Also eujex = —6xy = av/dy 
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and 
duley = 3y? — 2 — 3x? = —(—3y? + 2 + 3x?) = --δυ δχ 
and u, v, are harmonic conjugate functions, 
Further, w =u + iv 

= γῆ — 2y — 3x?y + ix® + 2ix — 3ixy? + iC 

= f(x + iy). 
Setting y = 0 in the last result gives 

SL) = ix® + 2ix + iC 
and so {{2) = iz* + 212 + iC. 
With f(0) = 1, then iC = 1, ie. C = —i, and finally 
w = f(z) = i(z? + 22) - 1. 

EXERCISE 10 


1 Express the following complex numbers in the form a + ib, where 
a and ὃ are real: 


(i) ic + id), (ὃ (1 + δ — 34), 
(iii) (1 + 22)/(1 — ἢ, (iv) (ἃ + ΚΙ — ἢ, 
| (v) 1/1 + 2), (vi) (1 +B + 21/2 — ἢ. 


form a + ib, of (i) z*; (1) ([ + 2)/(1 — 2); iii) z + 1/2z. 

3 In the relationship (R + ipL)(S — i/pC) = P/Q all the quantities 
are real except i. Show that p = 4/(R/LSC) and find R in terms of 
C, L, P, Q, and S. 


4 Express in the form re”, giving the principal value οὔθ: 
(ἢ +3; ὦ -3; Gi) 1+i; GY i-1; () V3/2—-i/2; 
(vi) —8¢V3-D; (ΟἿ BG+H)/Qi-—1}; (Will) ( Ὁ ἢ; 
(ix) (1 + e')(1 + e9'7/2), 

5 Simplify 2/(cos 7/4 + isina/4); (4/3 — i). 

6 Solve the equation (x? + 1)? = 4(2x — 1) given that 27 — 1 is one 
root. 


| 2 Given that z = x + iy, find the value, in terms of x and y, and in the 


7 Find all the solutions of z® + 1 = 0 and indicate their positions on 
the Argand diagram. 


8 Find the cube roots of 8(i4/3 — 1). 
9 Solve the equation z° + 1 = 0 and hence deduce that 
Zz + 1 = (z + 1)(z? — 22 cos 7/5 + 1)(z? — 22 cos 3π|5 + 1). 


10 Use de Moivre’s theorem to show that 
cos 50 = 16 cos® 6 — 20 cos? @ + 5cos @ 
sin 50 = 16 sin® 6 — 20 sin? 6 + 5 sin @. 
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11 Writing 2 = cos? + isin show that 2cosn@ =z" +27" and 
2isinn@? =z" —z~" where n is any integer. By expanding 
(z — 1/z)*(z + 1/z)* deduce that 


28 sin* 6 cos? 6 = cos 79 — cos 56 — 3 cos 36 + 3 cos 8. 


12 Derive the following identities from the equivalent trigonometrical 
identities 
cosh? Φ — sinh? ¢ = 1, 
cosh? Φ + sinh? ¢ = cosh 2¢ 
= 2 cosh? ὁ — 1 
= 1 + 2 sinh? φ, 
tanh 26 = 2 tanh f/(1 + tanh? ¢). 
13 Given that 
cos (x + iy) = cos 6 + isin 6 
where x, y, and 6 are real prove that 
cos 2x + cosh 2y = 2. 


14 Show that the solutions of cos z = — 3 are the solutions of the 
simultaneous equations 
cos x cosh y = — 3 


sin x sinh y = 0. 
Hence show that the solutions are 
z= (2r + 17 + icosh™ 3. 
15 Describe geometrically the path or region of the z-plane if 
(i) Rez=2 (ii) Im(z +iz)=0 (ii) 0 < argz < 1/3 
(iv) [2] <1 (v) |z — 3] =2 (vi) |z + 37: 4 
(vii) 27 = 2 (viii) |z/(z — Ὁ] = V2 
(ix) 1 < |z| <2, 7/4 < argz -Ξ- πί2. 
16 Test whether the following functions are analytic and find f’(z) 
where possible, stating any restriction on z. 
(i) f(z) = χ + y? — 2χν (ὃ f(z) = 2x — 3y + 13x + 2y) 
ii) f(z) = sin (1/z) (iv) f(z) = xe’. 
17 Write the analytic function f(z) in the form u(r, 6) + iv(r, 8) where 


z= re’, and from the Cauchy-Riemann equations in Cartesian 
form deduce the equations in polar form 


ou 1 ὃ Lowe 
arog ~ : 
18 Determine value for a, b, c, and d if 
(x? + axy + by) + i(cx? + dxy + y?) 
is analytic everywhere. 


COMPLEX NUMBERS AND COMPLEX VARIABLES 273 


19 Investigate the paths into which the real and imaginary axes of the 
z-plane are mapped under the transformation w = (z + 1)?. What 
region in the z-plane is transformed into the semicircle u? + v? <1, 
v > Oin the w-plane? 


20 A, B, and C are the points 1, 2, (2 + i) respectively, in the z-plane. 
Find the paths in the w-plane which correspond to the straight lines 
AB, BC, and AC when w = 1 + 2/z. Sketch on the Argand dia- 
gram. (Hint. For AC, evaluate u + v and (u — 1)? + v?.) 


21 When w = e? show that the straight line x = ky, y > 0 in the 
z-plane is transformed into an equiangular spiral in the w-plane. 


22 Show that u = y® — 3x?y is a harmonic function and use the 
Cauchy-Riemann equations to find the harmonic conjugate function 
v where w = u + iv is analytic. Express w as a function of z. 


23 Use the Cauchy-Riemann equations to show that if w= u-+ iv 
= f(z) is analytic then each member of the family of curves 
u(x, y) = constant cuts each member of the family of curves 
v(x, y) = constant orthogonally. Illustrate by sketching the families 
of curves given by the transformation 


(i) w= 27; (ii) w=1/z; (δ z = sinw. 


24 Show that u = x + 3x*y — y® is an harmonic function. Find the 
harmonic conjugate function v such that w = τὶ + iv is analytic. 
Express w in the form f(z) where z = x + iy. 


1 
Ordinary Differential Equations 


11.0 INTRODUCTION 
An ordinary differential equation is a relation connecting one inde- 
pendent variable say x, one dependent variable say y, and differential 
coefficients of y with respect to x. Many relationships in engineering 
are most easily expressed in the form of a differential equation. For 
example dg/dt = —kq is the differential equation formed from the 
problem of a leaky condenser discharging at a rate proportional to 
the charge 4. 

The order of a differential equation is the order of its highest deriva- 
tive. The degree of a differential equation is the power of its highest 
order derivative. For example 


αν 2 d 3 
(a2) Ξ 42° 


is of order 2 and degree 2. 

The solution of a differential equation is obtained by integration. 
The equation is solved when y is expressed in terms of x by a relation- 
ship of the form y = f(x) or ¢(x, y) = 0. A differential equation of 
order n will in general have a solution involving n arbitrary constants. 
This solution is called the general solution. If boundary conditions 
are prescribed, these arbitrary constants can be specified. 

The differential equation 


dy 
ὡς LM), or = Dy = f(x) (1) 
has been solved, in Chapter 6, by integration, to give 
y = | fo dx + Ὁ. (2) 


Similarly the equation 
d"y 
τς = Se) 3) 
may be solved by integrating n times giving 
Y= Co + Cx + Cox? +0 + Cai x™ 1 + F(x) (4) 


where Cy, Cy, Cg, . . ., Ca-1 are arbitrary constants and F,(x) is the 
result of integrating f(x) n times. 
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11.1 FIRST ORDER EQUATIONS 
11.1.1 One variable absent 


(i) If the equation involves dy/dx and x only, i.e. ΕΞ ay = f(x), then 
integrating both sides 


y = | fo) de + σ. 


ὦ 
Example Solve x® Ξ = 2x? + 3. 


Solution Divide by x°, x 40 


3 
then y = 2 log |x| ace C, 


where C is an arbitrary constant. 


(ii) If the equation involves dy/dx and y only, i.e. dy/dx = g(y), 
g(y) 4 0, then 


τς ape 
ἄν 8) 
and integrating both sides 
dy 
χε] -Ξ τ Ἐς 
P J 80) 
Example Solve > = ay, a given non-zero constant. (5) 


Solution Rearranging and integrating, 


x= Z+C 
=< log|y| + C 
or log |y| = ax — aC 


and this is one form of solution of (5), where C is an arbitrary constant. 
Now 


y = eat ασ 
== e% e- ac 
or y = Ae™, (6) 


where A = ο΄ °° is also an arbitrary constant. 
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When a logarithm appears in an integral it is often more convenient 
to write the arbitrary constant in logarithmic form. Thus 


ax = log |y| + log B 
then e* = By 
or y Ξ 4: A=1/B. 
The solution y = Ae is well known as the solution of equation (5), 


since οὗ is defined in Chapter 4 as the particular solution of dy/dx = y 
such that when x = 0, y = 1. 


11.1.2 Variables separable 
Equations of this type are of the form 


d 
40) > = flx) (7) 


or 80) dy = f(x) dx, 
i.e. the variables x and y may be separated. Then the complete solution 
is given by 

[sora = [κα το Θ 


Example (i) Solve tan x(dy/dx) = cot γ. 
Solution Re-write the equation as tan y dy = cot x dx, then 
sandy = [Pate 
or —log |cos y| = log |sin x| — log ς, 
or cos ysinx =c, 
1 τὰ arbitrary constant is taken in the most convenient form, 
= —loge. 


Example (ii) A mass m falls vertically from rest from a distance h 
above the ground. Determine the velocity with which it hits the ground 
if the air resistance is proportional to the square of the velocity. 


Solution Let the distance x be measured as positive downwards. 
Velocity v = dx/dt. 


Acceleration = ee dee ie (9) 
From Newton’s Second Law of motion for constant mass, 
dv 


mg — mkv? = mv --- 


ax 


where the constant of proportionality is mk. 
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Hence | <= pada [ἀπὸ 

or — splog |g — Κυὶ =x Ἐ 6. 

The initial condition x = 0, v = 0 determine C = (—log g)/2k so that 
x= Zr log a . 

Hence em'* = gi(g — kv’), 

and p? = ἔᾳι — ὁπ 3) 


gives the velocity at distance x from the start. 
In particular, when x = ἢ, 


v= ἔχ — e-™), 


Note that when h -- οὐ, v? > g/k. That is, no matter from what height 
the mass falls, the velocity will never exceed +/(g/k). 


11.1.3 Homogeneous equations 
These are equations of the form 


and are reduced to the form of equation (7) in which the variables are 
separable, by the substitution u = y/x. If y = ux, then 


and substituting for y and dy/dx in equation (10), leads to the equation 
du 
u+x Fe fl). 


In this equation the variables separate. 


Example (i) Solve the differential equation 
d γ 
xo =yt+ylog (2). 


Solution Write y = ux, dy/dx =u-+ x duldx, then the equation 
becomes 


du 
x? Fi log u. 
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If x 40, divide by x and separate the variables so that the equation 
now is 


Integrating, log log |u| = log |x| + logc, or logu=cx. That is, 
u = e and the solution of the equation is 


y= xe™, 


where c is an arbitrary constant. 


Example (ii) Solve the differential equation 
Sia dae 


dx x+y 
Solution Writing y = ux, dy/dx = u + x duldx gives 
u+x ἴω eae 


dx 1+u 


and hence 
du_ 9— uv? 
χα ἢ +u’ 


Separating the variables, 


ax (1 + ὦ du [ 2 4 | a 


x J@-wHG4H JB—u 344 
Integrating, 
log x = —$ log 3 — τὸ — flog (3 + τὸ + F loge, 
where c is an arbitrary constant. Hence 
x33 — u)?3+u)=c, 
or x9(3 — y/x)?3 + y/x) =, 
or (3x — y)*3x + y) =c. 


Example (iii) Solve the differential equation 
Gy eae 
dx x+y+1- 


Solution This equation can be made homogeneous by changing the 
variables to X and Y where Y= x-+y-+1 and Y=9x+y+2. 
Then dX = dx + ἂν, ΑΥ̓͂ = 9dx + dy and 


ὅρα ὦ ἢ 
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and the equation becomes 
ΑΥ̓͂ 9X+Y 
dX X+Y 
which is solved in example (ii). Hence its solution is 
(3X — YBX+ Y)=c, 
and the required solution is 
(6x — 2y — 1) 12χ + ἀν + 5) =, 


where c is an arbitrary constant. 


11.1.4 Exact equations 
An ordinary differential equation which can be written in the form 
M(x, y) dx + N(x, y) dy = dF(x, y), (11) 


such that the left side is an exact differential, is an exact equation. 
That is, there exists a function G(x, y) such that 


8G 8G 
δ» By =N. (12) 
Hence (11) may be written 
8 
se dx ree ay aye dF(x, y), 
or 
dG = dF. 
Integrating, 


G(x, y) = F(x, y) + C. 


Differentiating equations (12) partially with respect to y and x respec- 
tively, leads to 


and a necessary condition for exactness in (11) is 


ΟΜ aN 
ὃν x a 
That is (11) cannot be exact unless equation (13) is true. It can be 
proved that the equation is always exact when equation (13) is true, 
although the proof is beyond the scope of this book, and equation 
(13) is both a necessary and sufficient condition that (11) is exact. 
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Example (i) Solve y?dx + 2xy dy = εὖ dx 
Solution Re-write the equation as 

a(xy*) = de’), 


and integrating 
xy=e’+C 


where C is an arbitrary constant. 


It can be proved that the left side of (11) can always be made exact 
by multiplication by a suitable integrating factor p(x, y), which has the 
property that 


Flow, YMG 91 τ ἐς foGe, YING I 

It is not easy to determine p from this equation. However, certain 
combinations of differentials may be recognised. 
Example (ii) Solve x dy — y dx = xy?dx. 
Solution Divide by y? and transpose to give 

ydx —xdy 

y? 

1.6. the integrating factor is 1/y? so that 


x 
46) xdx =0. 
y + 


= —x dx, 


Integration gives 


where C is an arbitrary constant. 


11.1.5 First order linear equations 
The differential equation is of the type 


dy 

+ Plxly = OC) (14) 
First, let P(x) = a, a a given constant, so that 

dy 

ie t Ξ 2) (15) 


The left side is not an exact differential but can be made so by multiply- 

ing by an integrating factor. If Q(x) = 0, equation (15) has the solution | 
y = Ae~™, or ye* = A. 1.6. d(ye**) = 0. This suggests that the left 
side of equation (15) can be made an exact differential by multiplication 
by e. 
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Multiplying both sides of equation (15) by 665 (40) gives 


d 
et = + ae**y = οὐ: O(x) 
d 
or Ζ: (652 = e* Q(x). 
Integrating, 


γε = | e7O(x) dx +C 


or y=e -* fer*O(x) ax + Ce7? (16) 


where C is an arbitrary constant. 


Example Solve 
a 
dt 


with g = 0 at ¢ = 0, R, C, and V are given constants. Note that C is 
not an arbitrary constant. 


R +fav 


Solution The equation may be written 


dq, 4 


4.4 Υ 
at RC™R 


Multiply by e'/*° to give 
5 (get!®°) = ene 

Integrating, getRC = VCetlFo Ἐκ 

where & is an arbitrary constant. 

Hence 4 = VC + ke "το, 

The initial condition Φ = 0, t = 0 give the value of k = —VC and 
4 = VC(1 — ε΄ #9), 


Secondly, consider the more general equation (14). This can be 
made exact by multiplication by an integrating factor z(x) to be 
determined. For let P(x) = dz/dx, or z = | P(x) dx, then equation (14) 


becomes 
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Multiplication by e* gives 
d 
+ (6) = OC), 
from which on integration, 
ey = ferowy dx+C (17) 
where C is an arbitrary constant. 
Since Zi P(x) dx 
is known, y is given in terms of x. Hence, multiplying (14) by the 
integrating factor exp ({Pdx) makes the left side of (14) exactly in- 
tegrable. 
Example (i) Solve the differential equation 
dy 
x log x τς + y = 2logx. 


Solution Divide by x log x to bring the equation to the standard form 
of equation (14) 


dy y 2 
dx ἢ xlogx x (18) 

Here, P(x) = 1/x log x, fPdx = log |log x| so that exp (fPdx) = log x. 

Multiply equation (18) by the integrating factor log x and find 


dy _y_2 
log x 5 ἘΞ-Ξ x log x, 
d 2 
ΟΓ ὡς 108 x) = 5 log x. 
Integrating, y log x = (log x)? + C 
and y = logx + C/logx 


where C is an arbitrary constant. 


dx dx 
Ἢ — = e-" sec? x —. 
Example (ii) Solve 1 + y dy ο΄ sec? x By 


Solution Multiply by dy/dx to obtain the standard form of equation 
(14), 
dy 


πο γε ε sec? x. 
ax’? 
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The integrating factor is ο΄, and hence 
d 
re (ye) = sec? x. 
It follows that ye? =tanx+C, 
or y =e *tanx + Ce? 
where C is an arbitrary constant. 


11.2 SOME SPECIAL TYPES OF SECOND ORDER 
EQUATIONS 

Some types of second order differential equations can be reduced to 

first order by a suitable change of variable. 


11.2.1 Equations with the dependent variable absent 


This type of equation involves d*y/dx?, dy/dx and x only. It is reduced 
to a first order equation by the substitutions 


S47 (19) 
so that the original equation becomes one involving dp/dx, p, and x, 


which is of the first order in p. This may be solved for p as a function 
of x and, by further integration, y obtained as a function of x. 


Example Solve the differential equation 
x55+7—=0. (20) 
Solution Using (19), equation (20) becomes 
x Φ +p=0, 
so that [2+ [Feo 


Integrating leads to 
log |p| + log |x| = log A, 


or p=Alx 
where 4 is an arbitrary constant. 
dy A 
Hence Se A i 
dx x 


and integrating once more, 
y=B+ A log |x|. 


This is the solution of equation (20) where B is a second arbitrary 
constant. 
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11.2.2 Equations with the independent variable absent 
This type of equation involves d*y/dx?, dy/dx, and y only. The substitu- 
tions used are 

de Gxt ae = dy dx =P ay ey 
The differential equation now involves p, dp/dy, and y and is of the 
first order in p. Its solution gives p in terms of y and a further integra- 


tion leads to y in terms of x. 


Example (i) Solve the differential equation 


Wad?yldx*) + (dy/dx)? = 0 (22) 
Solution The substitutions of (21) used in (22) give 
y(pap|dy) + p? = 0 (23) 
or dp|p + dyly = 0. 
Integrating, 
p= Aly, 
where Α is an arbitrary constant. Then 
p = ἁ άν = Aly 
and y? = 2Ax+B 


where B is another arbitrary constant. 


An elementary application of this method is given by the solution 
of the equation of motion of a particle moving with simple harmonic 
motion as follows in the next example. 


Example (ii) A particle of mass m moves in a straight line under a 
force which is always directed towards a fixed point in the line and 
which is proportional to the distance of the particle from the fixed 
point. Examine the motion. 


Solution Let the fixed point be the origin O and the fixed straight line 
be the x-axis. Let the force per unit mass at distance x from O be n?x 
where ἡ is some constant and let ¢ be the time, then the equation of 
motion of the particle is 


OX 9 
ma = —mn’x, 
d? 
or a —n?x, (24) 
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Using substitutions corresponding to those in (21), i.e. 
ΡΝ 
a ae Pax 
equation (24) becomes 
ap + nx =0 
Pax : 


Integrating, 4p? + 4n?x? = 4a°n?, where }a°n? is chosen to be the 
arbitrary constant. Then p? = n?(a? — x?) and 


d: 

p= ἢ = Ξιην(α — 2%) 

Integrating once more, 
sin} (χα) = ει + 8), 
or x = +asin (nt + ε). 
Since a is arbitrary there is no need for the double + sign and the 
general solution of equation (24) is 
x = asin (nt + ε). 


Thus the motion is simple harmonic with period 27/n. The maximum 
displacement on either side of O is a, which is the amplitude of the 
motion. The angle « is the phase angle. 


11.3 LINEAR DIFFERENTIAL EQUATIONS WITH 
CONSTANT COEFFICIENTS 


These equations have the general form 


ἐπ" ἀπ-1 d™-2 
ας ἘΣ Onna Gack tna ac tt ey = WH), (5) 


where do, 4:1, (5, . . ., a, are given real constants and V is a given 
function of x. 


11.3.1 The D-notation 
Let D be the operator d/dx so that Dy = dy/dx (see section 2.1.3.) then 


dy a 2 

dx? ἀχλάχ 
is written D2y, and D™ means the operator d"/dx™ where m is a positive 
integer. Then, if m and n are positive integers, 


Dt = Drtm™ = D™[D"] -- D"[D*}. 
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The operator D is an example of a linear operator which is defined as 
follows. An operator ¢ is a linear operator if 


ὦ $01 + ya) = φῦ + (2) and (26) 
(ii) d(ky) = kd(y), & constant. (27) 
It is simple to show that ¢ = D satisfies these equations. Also, 
suppose y = y, and y = y2 both satisfy the equation 
φ0) = 0, (28) 
then y = Ay, + By also satisfies equation (28), where A and B are 
arbitrary constants. For, substituting y = Ay, + By, in equation (28), 
dy) = φών: + By2) = $(Ays) + $(By2) from (26) 
= Ag(ys) + Bb(y2) from (27) 
=0, 
since both y, and y, satisfy equation (28). 
Similarly it may be shown that if y,, yo, ys, . - .» Ya are all solutions of 
equation (28) then so also is 
= Α.γι + Agye + Ag Y3 +. FAR as 
where A,, Ag, Ag, . . ., Ay are arbitrary constants. 


An important property of linear operators is that they may be 
factorised. For example, 


ἀν dy 


d (ὦ ὦ 
ΤΞ Φ- 6 τς (2+) -3 (Z+): 


which may be written as 


(ὦ -ἡ 2 9) 


Writing the operator d/dx as D leads to 
(D? — ὃ — ὃν = (Ὁ ~ 3.0 + 2)y, 
or, the operator D satisfies the equation 
D? — D—6 =(D — 3)(D + 2). 
Any polynomial P(D) in the linear operator D is itself a linear 
operator, for if 
P(D) = α,}" + a,_,D""? +... +4D+ a (29) 


where do, a1, @2,. . -, @, are constants, then P(D) satisfies equations 
(26) and (27) since each term on the right of (29) satisfies these equations. 


11.3.2 The solution of differential equations with constant coefficients 
Equation (25) may now be written 
(a,D" + a,-yD" '+...+4D+a)y = fal 


or $y) = Vix) (30) 
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where the linear operator ¢ is P(D) as defined in equation (29). 

The technique for solving equation (30) is as follows: 
(i) let γε be the complete solution containing m arbitrary constants of 
the reduced or homogeneous equation 


φ0) = 0; 
(ii) let y, be any particular integral of equation (30) so that d(y,) = 
V(x). Then the complete solution of equation (30) is 


)») 5). Ὁ» (31) 
since HY) = $(Ye + Yo) 

= φύ + (V2) 

= 0+ V(x). 

The rest of this chapter is devoted to finding y., called the comple- 
mentary function, and y, called the particular integral, for different 
forms of the function V(x). The complementary function contains the 
arbitrary constants of integration. The particular integral is any 
function, no matter how simple nor how found, which satisfies the 


differential equation. For example it is easy to see that the function 
» = x is a particular integral of the differential equation. 

dy 

aa (32) 


The complementary function is A sin (x + B) from the example (i) in 
section 11.2.2, and contains the two arbitrary constants. Hence the 
complete primitive of equation (32) is 
y =Asin(x + B)+-x. (33) 
Expanding sin (x + B), 
y =Asinxcos B+ Acosxsin B+ x; 


and since A, B are arbitrary constants, so are Acos B and Asin 8 
and the solution (33) may be written 


y =Csinx + Ecosx +x (34) 
where C, E are arbitrary constants. 


11.4 THE COMPLEMENTARY FUNCTION 
Consider the solution of the second order differential equation 


(D? + aD + b)y = 0 (35) 
with a and b given real constants. This may be written 
(D — AyD — py = 0, (36) 


where A and yu may be real and distinct, real and equal, or complex in 
which case they are conjugate, (see section 10.5.3). To solve equation 


36), | 
Coes ὦ -- wy = ux) (37) 
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where u is a function to be found. From equations (36) and (37), 
(Ὁ — Aju =0 and from equation (6) of section 11.1.1, u = Ae*, 
where A is an arbitrary constant. Substituting this value of u in equation 
(37), 

(D — μὸν = 46: (38) 


and this is a first-order equation in y solved as in section 11.1.5. Multi- 
plying equation (38) by the integrating factor e~ “* gives 


D(ye™ "Ὁ = Ae@- ὼς 
and integrating gives 
ye" = [A000 dx + B 


or y= or [Act wz dx + Be#* (39) 


where A, B are arbitrary constants. This is the general solution of (36). 
Three cases arise, depending on the nature of the roots A, μ of the 
equation 
D?+aD+b=0 
Case (i) A, wu real and distinct. 


Equation (39) becomes 


y= e#? ἢ τ ἢ e(4- Hz + Be#= 
or y = Ce** + Be* (40) 


with B and C arbitrary constants. 
Case (ii) A = μ, roots real and equal. 


Equation (39) becomes 
y =e* | Adx + Be* 
or y =e*(Ax + B). (41) 


Case (iii) 4, 4 complex. 


Let A=a+if,u~=a—if. From equation (40), 
y = Ce** + Be", since the roots are distinct, or 


y =e(E cos Bx + Fsin Bx) (42) 
with E, F real arbitrary constants. 


Example (i) Solve the equation 
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Solution Write as (D? — 5D + 6)y = 0 
or (Ὁ — 2)(D — 3)y = 0, 
and from equation (40) the solution is 

y = Ae* + Be*, 
where A and B are arbitrary constants. 
Example (ii) Solve (D? + 2D + Τὴν = 0. 
Solution The given equation may be written 

(D + 1)*y =0, 

and from (41) the solution is 

y = (Ax + Bye“? 
where A and B are arbitrary constants. 
Example (iii) Solve (D? — 2D + 5)y = 0. 


Solution The quadratic in D will not factorise to give real factors. 
Since 
D?—2D+5=(D-— 1)? +4, 


the values of A and μ are 1 + 2i. From equation (42) the solution is 
y =e7(A cos 2x + Bsin 2x), 
where A and B are arbitrary constants. 

The method of solution of the nth order differential equation 
P(D)y = 0, where P(D) is a polynomial in D of degree n, is an 
extension of the method for the second order equation. 

Example (iv) Solve (D — 2)(D — 3)?(D? + 4)y = 0. 
Solution The roots of P(D) = 0 are 2, 3, 3, +2i, and 
y = Ae** + (Bx + Che? + Ecos 2x + Fsin 2x 


where A, B, C, E, and F are arbitrary constants of integration of the 
fifth order differential equation. 


Example (v) A particle of mass m moving in a straight line is acted on 
by a restoring force which is proportional to its displacement from a 
fixed point in the line and also by a resistance which is proportional to 
the velocity. Write down the equation of motion of the particle and 
solve it. 

Solution As in example (ii) of section 11.2.2, the equation of motion 
of the particle may be written in the form 


dx 
ae ee gy et οε- μερὶ 
m mn?x — 2mk =P 
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where the last term represents the resistance proportional to the 
velocity and k is constant. Let D represent the operator d/dt, then the 
equation of motion may be written 


(D? + 2kD + n*)x =0. (43) 
The roots of the quadratic D? + 2kD + n? = 0 are 
—k + V(k? — n?). (44) 


Three possibilities must now be considered: 

(ὃ If k > 7, (kK? — πϑὸ is real, and the roots are real, distinct, and 
both negative since k > +/(k? — n?). Let them be —A and —y. The 
solution of equation (43) is then 


x = Ας 4: + Be~ +t (45) 


where A and B are arbitrary constants. 
(ii) if k = the roots (44) are each equal to —k and the solution of 
equation (43) is 

x = e-F(At + B) (46) 


where A and B are arbitrary constants. 


(iii) if k <n the roots (44) are complex and may be written —k + if 
where β = +/(n? — ΚΞ) is real. The general solution of equation (43) 
in this case is 

= ae~*' sin (Bt + «) (47) 


where a and « are arbitrary constants. 

In all three cases (45), (46), and (47) the value of x approaches zero 
as 1-- co because of the negative exponential factor in each. When 
the value of k, called the damping factor, is equal to n, the rate of 
decay of x is greatest. In this case the motion is said to be critically 
damped. When k > n the motion is heavily damped or overdamped. 
When k <n the motion is oscillatory but the factor e~*' damps the 
amplitude of the vibration which dies away as 1 — οὐ, and the period 
is increased. 

Certain measuring instruments are constructed so that the motion 
of the pointer is critically damped in order that the pointer shall 
return to its equilibrium position as quickly as possible. Sketches of 
three possible distance-time graphs are shown in Fig. 11.1. 

Damped oscillations are important also in electricity where the 
differential equation for a simple oscillatory circuit may be shown to 
be of the form of equation (43). For let φ be the charge on the plate of a 
condenser of capacity C which is connected through an inductance L 
and resistance R, as in Fig. 11.2, and let there be no external e.m.f. so 
that the algebraic sum of the voltage drops is zero. 

Let j be the current in the circuit then the two equations 


dj ἜΤ ace ΕΝ 
LatRtG=0 and j= dg/dt 


ORDINARY DIFFERENTIAL EQUATIONS 291 
give the second order equation 
4 Κα q 
ΠΣ γα (48) 
This equation may be compared with equation (43) with q instead 
of x and where 2k = R/L, n? = 1/LC, so that 


(i) if R/2L > 1/-/(LC) the charge is not oscillatory and its behaviour 
is sketched in Fig. 11.1 (a) and (b). 


Case (i) 


Case (ii) 
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Case (iii) 


{c) 
Fig. 11.1 (cont.) 


L ἢ 
-q 
R c 
4 
i 
Fig, 11.2 


(ii) if R/2L < {{Μ 1.0} the charge is oscillatory and the amplitude is 
damped, as shown in Fig. 11.1 (Ὁ. 


11.5 THE PARTICULAR INTEGRAL 
The solution of 


(D? + aD + b)y = Vix) (49) 
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is y= yo + yp, where y, is found as in Section 11.4 and y, is any 
particular integral which satisfies equation (49) and contains no 
arbitrary constants. The form of the particular integral depends on 
the form of V(x). In this section the particular integral corresponding 
to V(x) as a polynomial in x, an exponential function or as a circular 
function will be investigated. 


11.5.1 V(x) a polynomial in x 

There are several methods for finding particular integrals, one of 
which consists in assuming a particular integral of the same form as 
V(x), and a second defines an inverse operator D-1. A third method, 
adopted here, consists in finding a particular integral of a set of equa- 
tions. This method is chosen as being the simplest to apply and the 
simplest to justify. It is explained by examples. 


Example-(i) Find a particular integral of the equation 
(D? + 2D — 3)y = 9χ3. (50) 


Solution Differentiate both sides of equation (50) repeatedly until the 
right side is zero as follows: 


(D? + 2D — 3)Dy = 27x? (51) 
(D? + 2D — 3)D*y = 54x (52) 
(D? + 2D — 3)D’y = 54 (53) 
(D? + 2D — 3)Dty = 0. (54) 


From equation (54) a particular integral of the set of equations is 
given by D*y = 0, from which it follows that 


D'*4y=0, n>o. (55) 
With this particular integral, substituting from equation (55) in equa- 


tion (53), from equation (55) and (53) in equation (52) and so on, the 
following results are obtained: 


from equation (53), δὲν + 2D*y — 3D®y = 54, 


1.6. Ρὲν = —18; (56) 
similarly, using equations (55) and (56) in equation (52) leads to 
D?y = —18x — 12; (57) 
and using equations (56) and (57) in equation (51) gives 
Dy = —9x? — 12x — 14. (58) 


Finally, using equations (57) and (58) in equation (50) gives 
—18x — 12 — 18x? — 24x — 28 — 3y = 9x3, 
or Vo = —3x° — 6x? — 14x — 40/3. 
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The complete solution of equation (50) is γε + yp where 

Vo = Ae 33 + Be* 
and A, B are arbitrary constants. 


Example (ii) Find a particular integral for the equation 


(D? + δὴν = x?. (59) 
Solution Differentiate equation (59) repeatedly after having re-written 
it in the form (DDD ese (60) 
to give (D + 1)D?y = 2x, (61) 
(D + 1)D*y = 2, (62) 
(D + 1)Dty = 0. (63) 


A particular integral is D***y = 0,n > 0. 

Substituting 

in equation (62) gives Dy = 2, 

in equation (61) gives 2 + D?y = 2x or D’y = 2x — 2, 

in equation (60) gives 2x — 2 + Dy = x®, or Dy = x? — 2x + 2, 
and this is a particular integral of equation (59), from which, by 


integration, 
Vp = 4x8 — x? + 2x, 


There is no need to add an arbitrary constant of integration as this 
would be absorbed in the arbitrary constant A of the complementary 
function which is y, = A + Be-*. 


Example (ili) Solve the differential equation 


ἀν d? 
at eH 2 + 4. (64) 


Solution This is a third order equation and the methods of the previous 
sections apply. The equation is 


(Ὁ + 1)D?y = 2 + 4x (65) 
from which the complementary function is 
Yo = A+ Bx + Ce™* 
where A, B, and C are arbitrary constants. To find the particular 
integral, differentiate equation (65) four times and obtain 
(D + 1)D*y = 12x? 
(D + 1)Dty = 24x 
(D + 1I)D5y = 24 
(D + 1)D’y =0. 
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Hence a particular integral is D"**y = 0, πὶ >0. 
Substituting back, D'y = 24, 
θήν = 24x — 24, 
ὃν = 12x? — 24x + 24, 
D?y = 2 + 4x3 — 12x? + 24x — 24. 
Integrating twice, 
Dy = x* — 4x3 + 12x? — 22x, 
Vo = 1χ — x* + 4χ3 — 1x? 
The complete primitive of equation (64) is therefore 
Y=Ve + Vp = (95 + A+ Bx — 11x? + 4x3 — χ' + 1χ5.Ψ 
11.5.2 V(x) of exponential form 
Consider the second order differential equation 
(D? + aD + δὴν = e* (66) 
where a, b, and « are given constants. 
Since D(e**) = ae“? and D?(e%*) = «?e%*, 
(D? + aD + b)e* = (a? - aa + b)e**. (67) 
Hence 
2 fs AN a ee 
(D +a +d] =e 


provided «? + ax + ὃ 0, and a particular integral of equation (66) 
is e%*/(a? + ax + δ) since this function satisfies equation (66). 

More generally if the differential equation is of order n, such that 
P(D) is a polynomial of degree n, the equation 


P(D)y = e** (68) 
has a particular integral 
e%t 
Yo = Bo) (69) 


provided P(a) + 0. 
Example Solve (D? — 3D + 2)y = 20 cosh 3x. (70) 


Solution Since 20 cosh 3x = 10e%* + 106 55, there are two particular 
integrals corresponding to 10e** and 10-** respectively. 


Now P(D) = D?—3D +2, so that when « = 3, P(3)=2, and 
when « = —3, P(—3) = 20. Hence using equation (69), 


Yo = 10e*/2 + 106. 7/20. 
The complementary function derives from 
(D -- τω — 2y =0, 
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and is y, = Ae* + Be**, so that the complete solution of equation (70) 
is 


y = Ae* + Be®* + 568: + ζει 3: 
where A and B are arbitrary constants. 


Suppose now that P(x) = Ὁ. This means that the term e** on the 
right of equation (68) occurs in the complementary function. It is 
necessary under this condition to use the theorem that 


P(D)(e**y) = e*P(D + ἢν (71) 


where k is a constant. This is proved by induction as follows. Since 
P(D) is a polynomial in D it is only necessary to prove 


D*(e"y) = "(Ὁ + Ὁ. (72) 


Assume that, for some fixed arbitrary value of n, equation (72) is 
true, then 


D"**(ey) = DD*e*y) = Die(D + k)"y] 
=e D(D + k)"y + ke*(D + k)"y 
= e(D+k)"(D+ ἃν 
= εἰ: (Ὁ + k)**y, 


That is, if equation (72) is true for some fixed arbitrary value of n 
then it is true for n + 1. When = 1 equation (72) is true, since 


D(e**y) = e**Dy + key = εἰ Ὁ + k)y. 


Hence equation (72) is true for all positive integers n > 1, and thus 
equation (71) is true for all polynomials P(D). 
Let the differential equation be 


P(D)y = e** (73) 


where « is a given constant and P(«) = 0. Write y = e%*u, where u is a 
function of x to be found, and equation (73) becomes 


P(D)(e**u) = e™. 
Using the result (71) with k = α leads to 
e“P(D + a)u = e**, 
Dividing by e** (340) gives 


P(D + wu = 1, 
and u is found using the method of section 11.5.1. 
Example Solve (D — 2)(D + τὴν = e”*. (74) 


Solution The complementary function is 
Ye = 4655 + Be-*, 
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where A and 8 are arbitrary constants. To find the particular integral, 
since P(2) = 0, write y, = e?*u in equation (74), then 

(Ὁ — 2)(D + 1)(e?*u) = e 
becomes e7(D —2+2(D+1 + 2)u = e%, 
or (D + 3)Du = 1. 
Differentiating, a particular integral is D2u = 0 and all higher deriva- 
tives are zero, and u = 4x. Hence Vp = $xe™ and the complete solution 


of equation (74) is 
y =(A + 4xje? + Be-?, 


11.5.3 V(x) is cos wx or sin wx 


Since e'°* = cos wx + isin wx, it is necessary to solve P(D)y = εἰς: 
using the methods of section 11.5.2, choosing the real part of the 
solution if V(x) = cos wx and the imaginary part if V(x) = sin wx. 
For if the equation is P(D)y = R,(x) + iR(x), the solution is y = 
yi + ive where P(D)y, = R,(x) and P(D)y, = R,(x), since 


P(D){y1 + iyo} = P(D)y, + iP(D)y2 = Ri(x) + iR,(x). 
In particular, if R(x) is sin wx, the solution is Im(y). 
Example (i) Solve (Ὁ + 1)?(D + 2)y = cos 3x. 
Solution The complementary function is 
ye = (A + Bxje“* + Ce“? 

where A, B and C are arbitrary constants. 
For the particular integral, consider 

(D + ἡ + 2)y =e, 
Now P(D) = (D + 1)?(D + 2) so that 

P(3i) = (31 + 1)?(3i + 2) 

= —2(17 + δὴ. 


1 
a ————____ padiz 
Hence Vo= κε] -- χη} δὴ e | 
17 — 6 


= Re| -ὸ Te (cos 3x + isin 3%) | 


17 6 
pees heck eae : 
ΕΣ cos 3x + 395 sin 3.) 


This may be written —}cos (3χ — 0) where οοβθ = 17/325 and 
sin @ = 6/325. The complete solution is therefore 


y =(A + Bxje-* + Ce-** — τ cos (3x + 6) 


where 6 is defined above. 
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Example (ii) Solve (D? — 2D + 2)y = εἶ sin x. (75) 


Solution The complementary function is obtained from the roots of 
D? —-2D+2=0, or (D—1)?+1=0, and from case (iii) of 
section 11.4, withA=1+in=1-—-i, 


Ve = e(A cos x + Bsin x). 
To find the particular integral consider 
(D? -- 2 + 2)y = ete! -- et bz, (76) 


With the notation of section 11.5.2, « = 1 4+ i, P(D) = (Ὁ — 1)? + 1, 
Pa) =?+4+1=0. 


Hence writing y = e“**u(x) in equation (76) leads to 

(D + 2i)Du = 1, (77) 
and differentiating this once leads to 

(D + 2i)D®u = 0, 


so that a particular integral of equation (77) is D"*°u =0, n>0. 
Substituting back in equation (77) gives Du = 1/2i, and integration 
shows that 


u = x/2i = — fix 
Thus Vo = Im[—Htixe*e*] 
= Im[—}ixe* (cos x + isin x)] 
= —4}xe" COS X. 
The complete solution of section (75) is therefore 
y =e*(Acosx + Bsinx — 3x cos x) 


where A and B are arbitrary constants. 


11.5.4 V(x) is x"e** 


In this case it is necessary to write y = e**u(x) and solve as in sections 
11.5.2 or 11.5.3. 


Example Solve d?y/dx? + n®y = x cos nx. (78) 
Solution Re-write equation (78) as 
(D? + n®)y = xe™, (79) 
The complementary function is 
ye = Acos nx + Bsin nx, 


where A and B are arbitrary constants. To find the particular integral 
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write y = e*u in equation (79) and 


(D? + n*)(e'™*u) = xe int 


becomes [(Ὁ + in)? + n?]u = x 
or (D + 2in) Du = x. 
Hence (D + 2in)D?u = 1 
and (D + 2in)D®u = 0, 


and a particular integral is D’*°u = 0, n > 0. Substituting back and 
integrating leads to the value of u as 


_ χ᾽ χ 
ea 4n? 
Ἶχ2 
Then Vp = la + 75) (e0s nx + isin m)| 


x2 
=> 475 COS NX + 2, Sin nx. 
The complete primitive of equation re is, therefore 


x2 
y= A cos nx + Bein nx + <5 4,2 “05 MX + ας Sin nx, 


where A and B are arbitrary constants. 


11.6 APPLICATIONS 


Many vibrational problems in engineering such as occur in oscillating 
electrical networks, the bending of beams, torsional problems, the 
extension of springs, give rise to second order differential equations of 
the types solved in section 11.5. For example, if the electrical network 
of Fig. 11.2 has an applied voltage V(r), equation (48) is replaced by 


dq Rdg 4. ΓῺ 

dt? “ Ladt’ LC L 
When V(Z) is a polynomial in t, an exponential, or a circular function, 
this equation is solved by one of the methods of section 11.5. When 


V(t) is not of these forms further methods, not discussed here, may be 
used to find the solution. 


11.7 ISOCLINES. GRAPHICAL INTEGRATION OF dy/dx = f(x, y) 
Suppose the solution of the differential equation 

dyldx = f(x, y) (80) 
is obtainable in the form 


y = $x, ©) (81) 
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where c represents the constant of integration. The graph of y = ¢(x, c) 
for any given value of ὁ is called an integral curve. Different values of c 
give the members of the family of integral curves. 
Example (i) Sketch the family of integral curves of the differential 
equation dy/dx = 3x? — 2x. 
Solution Here pee Ae 
and the integral curves are cubic curves of the form 

γ-- ο- χα -- 1) 


and are shown in Fig. 11.3. 


Fig. 11.3 


Let such a family as (81) be represented by the curves as shown in 
Fig. 11.4. 

The Fig. 11.4 also shows other curves, such as AA’, BB’, and CC’ 
which are the loci of those points, on the integral curves, that have a 
common slope dy/dx or φ' (χ, c). Such curves are called isoclines and 
Fig. 11.4 indicates isoclines AA’, and BB’ for which ¢’(x, c) is zero and 
the isocline CC’ for which $’(x, c) is negative unity. The isoclines are 
curves whose equations are given by 


Le. S(x,y) = m. (82) 
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In Fig. 11.4, AA’, and BB’ are isoclines with m = 0 and CC’ is the 
isocline with m = —1. 

However, it is frequently the case that the integral curves (81) are 
not obtainable (i.e. that the differential equation (80) cannot be inte- 
grated) but nevertheless the isoclines (82) can be drawn. If a sufficient 
number of isoclines are drawn, together with the line elements indicating 
the slopes of the integral curves that cross the isoclines, as in Fig. 11.5, 
it should be possible to construct, or at least to give an indication of, 
the shape of the integral curve passing through any given point. 

In fact it may not always be possible to sketch the isoclines and even 
if they can be drawn the interpretation of the form of the integral 
curves may be difficult. However it usually is possible to derive some 
indication of the behaviour of the solution to the differential equation 
in the neighbourhood of a point (Xo, yo). 

Example (ii) Use isoclines to indicate the form of the integral curves 
given that dy/dx = —x]/y. 
Solution The isoclines are those curves of equation 


—x/y=m or y=(-—1/m)x 


where m is the slope of the integral curves that cut the isoclines. 
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Fig. 11.6 curve 
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The isoclines are straight lines of slope (—1/m) passing through the 
origin. Since (m)(—1/m) = —1, the integral curves must cut the iso- 
clines at right angles. The Fig. 11.6 shows some of the isoclines and the 
associated slopes on the integral curves. 

In this case the integral curves are obviously a family of circles of 
equations x? + y? = οἷ, 

In Example (ii) the differential equation can be integrated by the 
method of section 11.1.2 to give 


[ra -βμὰ 


ie. dy? = —4x? + constant 
which is of the form x? + y? = c?, 


Example (iii) Draw the isoclines for the differential equation 
dy/dx = x + y?. Construct the portion of the integral curves, for 
x > 0, passing through (a) the point (0, 0), (b) the point (1, 1). 
Solution The equations of the isoclines are given by 

x+y?=m 


where m is the slope of the integral curves where they cross the isoclines. 


Fig. 11.7 
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The equation . 
y=m—x 


is a parabola whose axis is the x-axis and whose vertex is (m, 0). The 
family of isoclines is shown in Fig. 11.7. The Fig. 11.7 also shows the 
line elements, i.e. the portions of the integral curves where they cross 
the isoclines. The integral curve through (0, 0) has the shape of curve 
(a) and the integral curve through (1, 1) has the shape of curve (b). 
Clearly, if isoclines are drawn at closer intervals it should be possible 
to get a better approximation to the shapes of the integral curves about 
the two points chosen. The Fig. 11.7 also indicates that for x > 1 the 
solution y increases rapidly with increasing x. 


Example (iv) Sketch the isoclines and solution curves of 


dy 1 
x ἘΞ a 


Solution The isoclines are given by the equation 
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Comparison with YY = 1 shows that y(y + mx) = 1 is an hyperbola 
with asymptotes y=0, and y = — mx. Re-writing in the form 
1 — y? = mxy shows that the branch of the hyperbola through 
" (0, 1) is the one required. The isoclines and associated integral curves 
are shown in Fig. 11.8. 


EXERCISE 11 
1 Solve the following differential equations: 


. ἂν 
(i) 5, = 9; 


: d 
(ii) PQ? + Ὁ + yvoP + 1) SF = 0; 


ὦ 
(iv) VOxy) > = 1; 


+ dx 2 . 
(v) Se + cosh? y = 0; 


: ὦ 
(vi) γνῶχΞ + 3) ~ + xv(4— y?) =0. 
2 Solve the differential equations 


ὦ a +eysinx το; 
7 dy — > 
(ii) y FP = αν; 


d 
(iii) °(2 + i) = 1, given that y = 1 when x = 0; 


d 
(iv) ( — χὴν = = 4 — γ᾽, and show that the particular curve of 


the system which passes through the origin has equation 
y? = 4x(2 — x). 


3 Solve the differential equations 
ὦ wt DD 4 wa 6. Ὁ 1)?, y = 0 when x = 1; 


Gi) 2 + ay tanx =3; (it) xtogx 2 4 y = x log; 


: dx dx dy ’ 
(iv) Ee ay ap (v) cos x F + y sin x = cos 2x, 
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4 (Ὁ If dy/dx + 2y tan x = sin x and y = 0 when x = 0, show that 
y = —2 when x = =. 
(ii) Solve the differential equation 


dy Η 
ae + ycotx = χ', 
(iii) Solve the differential equation 
of 3 
ΩΣ ὅσος 


5 Use the substitution y = ux to solve 


d ace 
(i) x(x + y) Ξ =y(x—y)s ὃ 3x? = = x7 + 2y?; 


Ἢ d ; d 
(ii) "Ὁ = yx + y); (iv) x? + y? Ἐν Z = 0; 


d 
(v) (x? — 29) F + y? = ay. 


6 Show that the following differential equations are exact and find 
their solutions; 


(i) & + y) dx + (x + y?) dy = 0; 
(ii) (2xe” + e*) dx + (x? + le’ dy = 0; 
(iii) (2xy + y?) dx + (x? + 2xy — y) dy = 0. 
7 Solve the differential equations 
dy 
ΣΕ (2) ae’ 
given that when x = 0, y = 1 and dy/dx = 0; 
ἣν d? ὦ 
(Ὁ α -- x9 Ὡ2 - x ξξ -- 2; 


dy 

2 

(iii) ἜΦΥ (2) ΞΞΙῚ; 
where k is a given constant. 


8 A particle moves in a straight line, its distance x from a fixed point 
of the line at time ¢ is given by the equation 


dx k dx\? _ 

de + Κι) = 4% 
where k and yw are positive constants. If initially the particle is at 
rest at the point x) = 1/(2k), obtain an expression for the velocity, 


and show that the particle moves with constant acceleration 
#/(2k) in the direction of increasing x. 
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9 Obtain the complete primitive for each of the following: 


(i) (D? — 3D — 10)y = 0; (ὃ (D? + 3D)y = 0; 
(iii) (D? + 4D + dy =0; (iv) (D? — 4D + 13)y = 0; 
(v) (Dt + 4m*)y = 0, where m is constant. 


10 Find the most general solution of the following equations such that 
(a) y = 0 when x = 0; (Ὁ) y = 1, dy/dx = 0 when x = 0. 
ὦ B44 Zs ay =o; ὦ Z-2% 4 yao. 

11 Determine particular integrals for the equations 
(i) (D? + 4D + 3)y = x?; (ii) (D? + δὴν = 3x + 7. 

12 Obtain the complete primitives of the following equations 


(i) (D2 + 2D + 5)y = 4 + 2x; (ii) (D? — 4D + Dy = x?; 
(iii) (5D? — 4D — τὴν = 2; (iv) (D* + 3D®)y = x? 


13 Determine particular integrals for the following equations: 


(i) (D? + 4D + 3)y = εἰ; (ii) (D? + 4D + 3)y = 3e7*; 
(iii) (D? — 2D + τὴν = e*; (iv) (D? + 5D + δὴν = χε"; 
(v) (D? + 4)y = xe"; (vi) (3D? — 10D + 3)y = 2e*. 
14 Solve the following differential equations: 
d 
@ 4 ne aan 2: os eer Sy = (4 + 2x)e*; 
d*y ᾿ — x2%e22 
(ii) ax? + 2 ae xe 
eh 5. Sd igs ὦ, 
(iii) Be eget a - χε. 
15 Determine particular integrals for the differential equations: 
(i) (D? + 3D)y = S cos x; (ii) (D? + 9)y = 2 cos 3x; 


(iii) (D? — 2D + l)y = e* sin x; (iv) (D? — 2D + 2)y = e* sin x; 
(v) (Dt + 2D? + τὴν = sin x. 
16 Solve the following differential equations 
(i) (D? + 9)y = 40s 2x; (ii) (Ὁ + dy = cos x; 
(iii) (D? + 5D + 6)y = cos x + sin 2x. 


: d*x dx 
17 (Ὁ Solve ae 4 at 
t=0. 


δ d*x 
Gi) Solve Fe 
t=0. 


+ 4x = 0, given that x = 0, dx/dt = 2 when 


+x=3sin2t, given that x = dx/dt=0 when 
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18 (ὃ) Solve the differential a 
13 yy 62 ~ + Sy =e" 


given that when x = 0, y = 0 and dy/dx = —1. 


(ii) Find the most general solution of the differential equation 
(D? — 3D — 4)?y = 0, such that dy/dx = 0 when x = 0 and the 
ratio y/x tends to zero as x tends to zero. 


19 Solve the equations 
(i) (Ὁ — 1)?y = xe’, given that y = 1, Dy = ἐ when x = 0; 
οι a7y dy : 
(ii) meat 47: + 3y = 8e*, given that when x =0, y =0 and 
dyldx = 4. 


20 Solve the differential equations 


iy a: ee 32 > + 2y = 4e7*, given that y and dy/dx are zero when 
x is ae 
Gi) 1Ξ ane 21: ad π τ 10y = sin 3x, given that y = 0, dy/dx = 1 when 
x=0. 

21 Solve the differential equations 


(i) (D? — 2D + ly = χ; (ii) (D? — 10D + 29)y = ο΄ cos 3x; 
(iii) 9D? +6D+ Dy =0; (iv) (D?+2D4+ )y=x41; 
(v) (D? + 3D + 2)y Ξε ο; (vi) (D? + 2D + 5)?y = 0. 


22 ae the differential equations 


@ 4 2 cae 32 — 4y = 20s 2x; 
5 
«iy 4 = = + 3y; 


q? 
(iii) Ἢ Ψ 45} = 25ἰπ2χ; 


ἐν τ 2g 
Ξ - 2y = xe; 


(iv) κὰν 


d 
ee -- 4 ΔΑΝ Sy = x(e* sin x + x). 
7 dx 
23 Sketch the isoclines of the differential equation 
dy|dx = (x — y)/x 


for x > 0. Use these to sketch the solution curves for x > 0. 
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24 Use the method of section 11.1.3 to show that the general solution 
of the differential equation dy/dx = 1 + (y/x) is y = x log Ax where 
A is a constant. Sketch the isoclines of the differential equation, show 
the solution curve passing through (1,0), and compare this with the 
actual solution curve through that point. 


25 Sketch the isoclines of the differential equation 
dyldx = (x? + y%)Ix 
and use them to sketch the solution curve near to the point (2, 0). 


12 
Analytical Properties of Algebraic 
Equations 


12.0 INTRODUCTION 


This chapter is concerned with some of the simpler analytical properties 
of algebraic equations particularly those of polynomial type. These 
properties are of use in subsequent chapters. 


12.1 THE REMAINDER THEOREM 


This states that if f(x) is a function possessing a Taylor series at the 
point x = a then the remainder on dividing by (x — a) is f(a). 


Proof 
The Taylor series for f(x) is 
fo) =f/@+ α —af'a + (x —a?f"(@2+... 
= f(a) + ὦ — ὠφο) 


where 


φ0) = (ὦ +(e —af"@2i+... 
Hence SOME — α = Hx) + @/& — ὦ 
in which ¢(x) is called the quotient and f(a) is called the remainder. 


In particular if f(x) is a polynomial of degree n then ¢(x) is a poly- 
nomial of degree γε — 1. 


Example (i) If x* — 7x? + x — 215 divided by (x + 2) the remainder is 
(—2)* — 7(—2)8 + (—2) — 2 = 68. 


Example (ii) If x sin2x is divided by (x — 7/4) the remainder is 
(/4)(sin 27/2) = 7/4. 
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12.1.1 The factor theorem 


This states that if (x — a) is a factor of f(x) then f(a) is zero and is an 
immediate consequence of the remainder theorem. 


Example (i) Let f(x) be x? + 2x? — 5x — 6, then by trial 
f2Q)=8+8—10-6=0 
and (x — 2) is a factor of f(x). By inspection 
f(x) = α — ἰ 2" + 4x + 3) 
= (x — 2 + I + 3). 


Example (ii) Let f(x) be 1 — sin x then by trial (7/2) =0 and 
(x — 7/2) is a factor of 1 — sin x. But 


J(x) = 1 — sinx 
= fla[2) + (x -- m/f Cr) + (x — 7D "A/D". « 
=0+0+4+ (* -- π|2)2}|2. - - ... 


which shows that (x — π|2) is a double factor of 1 — βίῃ x. 


12.2 SOLUTION OF EQUATIONS. MULTIPLE ROOTS 


It follows immediately from Taylor series and section 12.1 that the 
equation f(x) = Ὁ has a root at x =a if f(a) = 0. This is a simple 
root if f(a) τ 0; a double root if f(a) = 0, f’(a) #0; a triple root if 
I'(@ =f"@ = 0, f"(a) # 0; and so on. The condition that f(x) = 0 
has n equal roots x = ais that f(a) = f‘@) =f") =...=f"-a) 
= 0, f(@ 40. 

If f(x) is a polynomial of order » then the equation f(x) = 0 can 
have at most n roots x = x1, X = X2, Χ = X3,.. .. X = X,_ because 
successive division of f(x) by the factors (x — x), (x — x2), ... 
(x — x,) gives a final quotient which is constant. On the other hand a 
general equation f(x) = 0 where f(x) has a Taylor expansion can have 
an infinity of roots. For example, the equation sin x = 0 has an infinity 
of roots given by x = ra, where r is an integer. 


12.3 DETACHED COEFFICIENTS. SYNTHETIC 
DIVISION 


It is often necessary to divide a polynomial by a linear expression 
(x — «) in order to examine the quotient and remainder. There are 
two ways in which the familiar long division process may be displayed. 
The first is by using detached coefficients in the long division process 
itself and the other is a condensed version of this display called synthetic 
division. To illustrate the methods consider the division of 3x* — 7x? + 
4x? — 3x — 5 by (x — 2). 
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12.3.1 Division using detached coefficients 
The reader should have no difficulty in following the long division 
process employed. 


1 -2) 3 - +4 -3 -- 6 —1 42 +1 


3 —6 
-1 44 
=e) 
+2 --3 
+2 —4 
+1 —5 
ἘΠ =o 


The quotient is 3x* — x? + 2x +1 and the remainder is —3. This 
process starts by dividing 3x* by x to give the first coefficient 3 of the 
quotient, this coefficient is then multiplied by the coefficient —2 of the 
divisor and subtracted from the next coefficient —7 of the dividend to 
give the term —1 of the third row. This process is repeated until no 
further division is possible. 


12.3.2 Synthetic division 

The process of section 12.3.1 may be conveniently contracted by 
omitting the irrelevant detail and changing the sign of the coefficient 
—2 of the divisor so as to make additions instead of subtractions. 
The process is displayed below: 


3 -7.4..3 eh 
42 +6 —2 +4 +2 


3 —1 +2 414-3 
The procedure is as follows: 


(i) write the dividend in detached coefficient form, any absent 
terms being shown by a zero coefficient, 

(ii) show the multiplier +2 (associated with the divisor, x — 2), 

(iii) carry the first coefficient of the dividend, i.e. 3, into the last row, 

(iv) multiply the first coefficient of the last row by +2 and place the 
product +6 in the second place of the second row, 

(v) add the two terms of the second column to give —1 as the second 
term of the last row, 

(vi) multiply the second coefficient of the last row by the multiplier 
+2 and add to the third coefficient of the dividend, 
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(vii) continue the process, 
(viii) the last term of the last row is the remainder and the preceding 
terms form the coefficients of the quotient. 


Example Use synthetic division to obtain the quotient and remainder 
when 3x° + 9x* — x? + 11x is divided by (x + 3). 
~ Solution 3 +9 +0 - ἘΠ +0 
—3 -9 O O +3 —42 
3 Ὁ 0 -1 +14} —42 


The quotient is 3x* — x + 14 and the remainder is —42. Note the 
zero coefficients in the dividend corresponding to the coefficients of 
x8 and constant term. 


12.3.3 Division of a polynomial by (ax — 5) 
Let f(x) be a polynomial of degree n then 


cs {3a 


αχ --ὃ alx — bla 
b 
“alee γι | 
ΜΝ sl) 


where ¢(x) is a polynomial of degree n — 1. 
The process is then seen to be as follows: 


(i) divide f(x) by the expression (x — b/a), 
(ii) the coefficients of the quotient obtained are a times too large but 
the remainder term is correct. 

Example (i) Divide 21x* — 7x° + 3x? — 10x — 1 by (3x — 1). 
Solution 21 —7 13 ~—10 --ἰ 

3 +7 O +1 —3 

21. 0 ΕΒ’ 29,4 

The quotient is 7x® + x — 3 and the remainder is —4, 
Example (ii) Divide 16x* + 24χ — 3x — 2 by (2x + 3). 


Solution 16 +24 +0 -—3 —2 
ΕἸ —24 0 0 +3 


16 0 0 --]-ὖ 


The quotient is 8x* — 3 and the remainder is +. 
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12.3.4 Repeated division 


Repeated division of a polynomial by a linear expression may be 
easily displayed by the synthetic division process. 


Example Divide 6χϑ + 80x? — 1 by (x + 2)°. 


Solution 6 0 0 +80 0 —1 
—2 —12 +24 —-48 —64 +4128 


6 —-12 +24 +32 —64) +127 
—2 —12 +48 —144 +4224 


6 —24 +72 —112} +160 
=9 —12 +72 —288 


6 —36 +144 | —400 


Hence 
6x5 + 80x? — 1 ; 400 160 
ce) πὰ Ἢ ἢ = 6x? — 36x + 144 — Gas +2) + G+)? + 2)3 
127 
+ (x ze 2)> 


It may be noted that these terms are the partial fractions relevant to 
(x + 2)%. 

Continued division may be used to express the given polynomial as 
a polynomial in the linear divisor. For example, continuing the example 
above 


6 —36 +144 
τ —12 +96 

6 —48| +240 
a —12 

6 | —60 


from which it follows that 


6x° + 80x? — 1 = 6(x + 2)5 — 60(x + 2)" + 240(x + 2)? 
—400(x + 2)? + 160(% + 2) + 127. 


From the Taylor series it is evident that the coefficients +127, +160, 
—400, +240, —60, and +6 are the successive terms f(—2), f’(—2), 
f"(—2), f’"'(—2), f(—2) and f©(—2) respectively. 
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12.3.5 Division of a polynomial by a quadratic expression 

Synthetic division may be extended to include the division of a poly- 
nomial of degree greater than two by a quadratic expression ax? + 
bx +c. Consider the long division of 3x* + 5x3 — 15x? + 10x — 1 
by x? + 2x — 3. In detached coefficient form this is 


1+2—3)3 +5 -15 +10 πὶ -1 --4 


3 +6 —9 
-| -6 +10 
-1 ~2 +43 
-ά +7 - 
-4 —8 412 
+15 --Ι3 


and the quotient is 3x? — x —4 the remainder being (15x — 13). 
As in section 12.3.2 this process may be contracted by omitting un- 
necessary repeated coefficients and changing the signs of the coeffi- 
cients +2 and —3 in the divisor so that any subtraction is replaced 
by addition. The display is similar to that of synthetic division by a 
linear expression but an additional row of coefficients is required. The 
first row of coefficients is obtained from the dividend. The second row 
is obtained from the products of the multiplier —2 with the coefficients 
of the quotient and the third row consists of the products of the multi- 
plier +3 and the coefficients of the quotient. The last row consists of 
the coefficients of the quotient and remainder. The condensed array 
is shown below 


+10 -1 


Ι 
Ι 
--} =6 $2 1 -8 
Ι 
| 


and the procedure is as follows: 


(i) carry the first coefficient of the dividend into the first place of the 
last row, 

(ii) multiply this coefficient by the multiplier —2 and enter the product 
in the second place of the second row. Multiply the same coefficient 
by the second multiplier +3 and enter the product in the third 
place of the third row, 
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(iii) add the elements of the second column and enter in the second 
place of the last row, 

(iv) repeat the process of (ii) and (iii) entering the products into the 
third and fourth places of the second and third rows respectively, 

(v) repeat the process until the last coefficient of the third row is 
found. Add the coefficients of the last two columns, 

(vi) the coefficients of the linear remainder are the last two coefficients 
of the last row. The coefficients of the quotient are the remaining 
ones of the last row. 


The procedure may easily be adapted to include a divisor with a 
coefficient of x? other than unity. As in section 12.3.3, the scheme is 
written for a divisor x? + (b/a)x + (c/a), the remainder will then have 
the correct coefficients but coefficients of the quotient will be a times 
too large. 


Example Divide 3x* + 2x? — x + 5 by 3x? — x 4 2. 
Solution The scheme becomes 
3 +2 -Ἱ +5 


and the quotient is (x + 1), the remainder (—2x + 3). 


12.4 RELATIONSHIPS BETWEEN THE ROOTS OF A 
POLYNOMIAL EQUATION 


Consider the roots of a cubic equation 
εχ + aox? + ax + ay = 0. 
Let the three roots be «1, a, «3 then 
A3X° + Ax? + a,x + ἀρ = a3(X — αι) (X — Hx — a3) 


and comparison of like coefficients leads to the following relationships 


αι + αὐ + a3 = —az/az 
ἀρῶ + Hye, + He = a,/ag (1) 
ατιαοῦς = - ἀρίας 


The left sides of equations (1) are the sums of the products of the roots 
taken one at a time, two at a time, and three at a time respectively. 
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Example (i) If «1, %2, %3 are the roots of 2χϑ + 3x2 — 17x + 12 =0, 
then 


αι + Ko + αβ = —3/2 
Mtg + Mgt, + αιὰ = —17/2 
αι.αςοᾶς = —6. 


Extension of this result to the n roots of an mth degree polynomial 
equation 


AnX” + Any X"" 1 + Ay_gx™ 2? +... 4+ ax+4+a,=0 
leads to the relationships 


La, = —a,-3/a, 
Σαια; = +4y_2/a, ΠΣ] 
Σα,α,ας ΞΞ --α,.- οἰα.. ifj x k 


(2) 


%1%y%q... ἄρ = (—1)"ap/a, 


in which the left sides are the sums of the products of the roots taken 
one at a time, two at a time, three at a time, up to 7 at a time. 

It should be noted that the relationships do not in themselves solve 
the original equations. For example, elimination of ας and a in 
equations (1) simply leads to 


43%,° + aga,? + aya, + ay = 0. 
However, if some additional information about the roots is known, 
then these relationships may help to solve the equation. 
Example (ii) Find the roots of 8x3 + 18x? — 17x + 3 = 0 given that 
one root is double one of the others. 
Solution Suppose the roots are «, 2x, and β then, 
3a + 6 = —18/8 (i) 
2a? + 3αβ = —17/8 (ii) 
2078 = — 3/8 (iii). 
Eliminating β between (i) and (ii) gives 
56a? + 54a —17=0 
which leads to 
a = 1/4 ora = --1714. 


From (i) β = —3 or 6 = 39/28 and of the two possibilities « = 4, 
B = —3 satisfies (iii) but α = —17/14, β = 39/28 does not. 
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12.4.1 Diminishing the roots of an equation 
The polynomial equation 


A,X" + ἀρχὴ 1 4+...4+4 =0 
and the polynomial equation 
a,x — k)* + δ,.χἰα —~K* 1 4+...4+5,=0 


where the δι are obtained, as in section 12.3.4, by dividing the first 
equation n times by (x — k), where k is a constant, have the same 


TOOTS G1, Hg, X&y,. . ., Ly. The equation 
any” + by-ry""* + ba-oy™ 2+... +5) =0 
therefore has roots («, — k), (ας — k), . . ., (ας, — k). This process of 


forming a new equation whose roots are each k less than the original 
is called diminishing the roots by k. Similarly the roots can be increased 
by k. 


Example Diminish the roots of x* — 6x? + 9x — 4 = 0 by 2. 
Solution By section 12.3.4 


1 -6 +49 -4 
42 2 τὸ {2 
:..-ὰὁ ἀπ} ee) 
42 2 -4 
Ι -᾿" - 
+2 


2 --- -- - - -- 
ε 1 | 0 


and hence the roots of y? — 3y — 2 = 0 are each 2 less than those of 
the given equation. 


12.55 LOCATION OF THE ROOTS OF A POLYNOMIAL 
EQUATION WITH REAL COEFFICIENTS 


The process of root location is usually a trial and error one but there 
are a number of properties that may be used as aids to location and 
classification of roots. 

(a) If x2 > x; and f(x), f(x) are of opposite signs then there is at 
least one root of f(x) = Ο in the range x, < x < x2. This is obvious 
geometrically. 


Example (i) If f(x) =x?+2x+1, then, since f(—1)<0 and 
(0) > 0, there is at least one root between x = —1 and x = 0. 

(b) If « and f are successive real zeros of a polynomial f(x) then 
J'(x) has an odd number of real zeros in the interval a < x < β. 
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This property follows from Rolle’s theorem of section 5.1.1 because 
f(a) = 0 = f(6), f(x) is continuous and f’(x) exists. Stated differently 
the real roots of f(x) = 0 are separated by the real roots of f’(x) = 0. 


Example (ii) If f(x) = χϑ — 7x + 2, then f(x) = 3x? — 7 and the latter 
has two roots x = ++/(7/3). Since Sri a 20) <0, f(—+/(7/3)) > 0, 


S(+ V(7/3)) < 0 and Les a f(x) >0 there are three changes of sign for 


JS(x) and it follows eon’ (a) that f(x) = 0 has three real roots. One of 
these roots is in the range —oo < x < —¥4/(7/3), one in the range 
- κ( 3) < x < ++/(7/3), and one in the range +/(7/3) < x < +00. 


It may be stated that if f’(x) = 0 has (7 — 1) unequal roots «,, a, 
%3,. . +, ἄμ in ascending order and if the signs of the terms of the 
series f(— 00), f(a), f(%2), f(%3), - . ., f(%a-2), f(+ ©) alternate then 
J (x) = 0 has 7 unequal real roots. 

(c) Since complex roots occur in conjugate pairs (see section 10.5.3) 
then a polynomial of even degree has an even number (including zero) 
of real roots and a polynomial of odd degree has an odd number of 
real roots (i.e. at least one real root). 

(d) The Descartes’ rule of signs states that if f(x) is a polynomial 
written in descending powers of x then 

(i) the number of Positive real roots of f(x) = 0 is not larger than the 
number of variations of f(x) and, 

(ii) the number of negative real roots is not larger than the number of 
variations of f(—x). 

In applying the rule, f(x) is said to have a variation if two successive 

terms have opposite signs. 


Example (iii) Show that x° — 11x° + 2x? + 12 = 0 has two complex 
roots, two positive real roots, and one negative real root. 


Solution Since f(—c) <0, f() = 12>0, (2) = —36<0, and 
J(+ 0) > 0 then, by (a), there must be at least one negative teal root 
and at least two positive real roots. Descartes’ rule of signs (d) states 
that f(x) = 0 has not more than two positive real roots and not more 
than one negative real root because f(x) has two variations and f(—x) 
has one variation. 

There are therefore exactly two positive real roots and one negative 
real root and hence two complex roots. 


(e) A rough graph is helpful in the location of the roots of an equa~ 
tion f(x) = 0. For example, a sketch of the curve y = x? — 3x — 1 
from x = —2 to x = +2 will show that there is one root between 
—2 and —1, one root between —1 and 0, and one root between 1 
and 2. 

Alternatively a sketch of the two graphs of y = x° and y = 1 + 3x 
will show the three intersection points. 
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12.5.1 Procedure after location of a root 


If a root x = « of a polynomial equation is known or can be exactly 
located then the polynomial may be divided by (x — «) to give a 
quotient and a zero remainder (the evaluation of the latter serves as 
a check on the working). An attempt may then be made to locate 
the zeros of the quotient and hence the further roots of the original 
equation. 

It is likely that the roots of a polynomial cannot be found exactly 
and in this case an approximate value must be obtained. Methods of 
approximate root location will be discussed in the next chapter. 


EXERCISE 12 

1 Divide 3x® — x° + 4χϑ — 1 by (i) x — 2, and (ii) 3x — 1. 

2 Factorise the function 4x* — 6x + 4x? — 3x + 1. 

3 Express 
(3x7 ~ 36χϑ + 162x5 — 324x* 4+ 244x3 — 13x? + 51x — 58)/(x ~ 3) 
as the sum of a quotient and partial fractions. 

4 Find the quotient and remainder when 4χϑ — 7x* + 24x3 — 18x? + 13 
is divided (i) by x? — 2x + 3 and (ii) by 2x? + 3x — 1. 

5 Using the relationships between the roots find the equation whose 
roots are the squares of those of 3x° + 2x —1=0. 


6 Solve the equation 12x* — 20x? + x + 3 = 0 given that the sum 
of two of the roots is 2. 


7 Show that 6y? + 19y? + lly — 11 =0 is the equation obtained 
by diminishing by three the roots of 6x* — 35x? + 59x — 35 = 0. 
Show that there is a root y = 4 and hence solve completely the 
equation in x. 


8 Show that f(x) = 6x* — 7χϑ — x — 6 = 0 has one negative real 
root, one positive real root, and two complex roots. 


9 An approximate root of x*+x?—7x+1=0 is x =2:10 
correct to two decimal places. Divide the polynomial by (x — 2:10) 
and find approximations to the other two roots. 


10 Show that x* — 3x —1=0 has three real roots, one between 
x = —2 and x = —1, one between x = —1 and x = 0, and one 
between x = 1 and x = 2. Show that the substitution x = rcos 0 
and the identity cos 30 = 4 cos* θ — 3 cos 9 transforms the given 
equation to cos 30 = } provided that r = 2. Solve this equation for 
6 and hence find all the roots of the given equation. 

11 Show that the method of Question 10 applied to the equation 
x? — 3x — 4 =0 leads to roots x = 2 cos θ where θ is a solution 
of cos 30 = 2. Solve for @ using the method of Question 14, Exer- 
cise 10 and hence find all the roots of the given equation. 


13 
Numerical Methods | 


13.0 INTRODUCTION 


The roots of an equation f(x) = 0, or the solution of a system of 
equations, may be found by either graphical or numerical processes. 
The former may be used if accuracy is not important, there being a 
limit to the accuracy with which a graph can be drawn and read. 
The numerical processes start with an approximation to the solution, 
often obtained from a rough graph, and then use a systematic scheme 
of numerical computation to improve the value. Such a process stops 
when the value is found to the required degree of accuracy. Any 
numerical process should, as far as possible, include some form of 
checking procedure so as to eliminate the propagation of errors. 
Among the numerical methods there is the iterative method. This 
starts with an approximation to the required value and by a given 
form of calculation produces a better approximation, which is then 
improved by the same form of calculation, the process terminating 
when two successive iterations give identical value to the degree of 
accuracy demanded (e.g. to the same number of decimal places). This 
method is the basis of most high speed computer programs, the 
computer performing a large number of iterations in a short space of 
time. It is possible for an iterative process to lead away from the 
required root, that is the process could diverge instead of converge. 
Under certain conditions the process may diverge from the required 
root but converge to some unwanted root. Usually it is necessary to 
ensure that certain criteria are satisfied by the equation or system 
of equations before commencing an iterative process. If these criteria 
are not satisfied then it may be necessary to employ an alternative 
process. 


13.1 GRAPHICAL METHODS 


When accuracy is not important, a root of the equation f(x) = 0 may 
be found by plotting the graph of the function y = f(x) over some 
range of values a <x <b for which f(a) and f(b) are of opposite 
signs. The root is then found by reading the abscissa of the crossing 
point with the x-axis. This is a process that needs no elaboration. 


13.1.1 Intersection of two graphs 


Instead of drawing the graph of y = f(x) and finding the intersections 
with the x-axis it is often an advantage to write f(x) in the form 
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d(x) — y(x) and then draw the graphs of the two functions y = ¢(x) and 
y = y(x). The abscissae of the intersection points give the required 
roots of f(x) = 0. 


Example The roots of the equation 1 + x? —3tanx =0 are ob- 
tained from the intersections of y = 1 + x? and y = 3 tan x. These 
curves are easily drawn to any reasonable accuracy and the abscissae 
of the intersections points obtained. The curves are indicated in 
Fig. 13.1 and from this rough sketch the smallest root is found to be 
approximately 0-4. 


Fig. 13.1 


13.1.2 Repeated plotting on a larger scale 


An approximation x = χρ to a root of f(x) = 0 may be improved by 
drawing the part of the graph y = f(x), or of the parts of the graphs 
y = d(x) and y = (x), near to x = x, to a larger scale and locating 
the root with additional accuracy. Instead of drawing the curves 
accurately it is usual to replace them by straight lines drawn through 
the end points of the interval about x = x. This introduces an error 
into the new estimate of the root so that the process is repeated until a 
sufficiently accurate result is obtained. 
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Example Continuing the example of section 13.1.1 the following 
table may be constructed 


x &x)=1+x? y(x) =3tanx 


0-3 1-09 0-93 
0-4 1:16 1:27 


from which it is seen that the curves of y = ¢(x) and y = (x) intersect 
between x = 0:3 and x = 0-4. Replacing the curves by straight lines 
through the end points of each curve as in Fig. 13.2 gives the improved 
value of the root as 0:36. 


Fig. 13.2 


An estimate of the error involved is found by calculating f(0-36). 
This is 0-0004, and the result 0-36 may be accurate enough to stop the 
process at this stage. Improved accuracy may be obtained by drawing 
the straight lines through the end points corresponding to x = 0-360 
and x = 0-361. This leads to the estimate x = 0-3601. 

At this stage it may be claimed that the result x = 0-360 is correct 
to three decimal places. 


13.2 NUMERICAL METHODS OF SOLVING f(x) = 0 

The graphical solution of section 13.1.2 can be represented numerically 
as an iterative process known as the rule of false position and this is 
discussed in section 13.2.1. In addition two other methods of iteration 
are considered. One is discussed in section 13.2.2 and the other, a 


simple but also a very powerful process, known as Newton’s method, 
is described in section 13.2.3. 


13.2.1 Rule of false position or method of linear interpolation 


Suppose a root of the equation f(x) = (x) — y(x) = 0 has been 
located between the values x, and x, + a (the smaller the value of a 
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the better), then reference to Fig. 13.3 shows that a better approxima- 
tion to the root will be given by x = x, = x» + h where, using similar 
triangles, 


h = (Xo) — $(%o) 4 
α---ἰ [φ( ὁ + ὦ — φάο + ὦ =O) 
h_ A 
τὰ a A+B 


Fig. 13.3 


The second approximation to the root is 


Χι = ΧΟ Ἔ (τ 2). (1) 


It may be noted that'if y(x) = 0 then (x) = f(x) and A, B are simply 
| f(xo)| and | f(xo + a)| respectively. 


Example (i) Calculate the root of 1 + x? — 3 tanx = 0 that lies in 
the rangeO <x <1. 


Solution Let ¢(x) = 1+ x? and y(x) = 3tanx, then it is easily 
seen that there is a root between 0 and 1. Suppose the first approxima- 
tion to be x, = 0, then the following table may be constructed and the 
next approximation found by an application of equation (1): 


x $(x) w(x) φ-- 
0 1 0 1 
1 2 4-67 —2-67 


With a = 1, A = 1, B = 2-67, then h = (1/3-67) x 1 
= 0:27 
= 0.3. 
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The second approximation is then 0-3. Proceeding as before: 


x A(x) yx) φ--ν 
0-3 1-090 0-928 +0:162 
0-4 1-160 1-268 —0-108 


and here, a = 0-1, A = 0-162, B = 0-108, from which 
h = (162/270) x 0-1 = 006. 
The third approximation is 0-36, then 


x #(x) (x) φ-- 
0-360 1-129 60 1:12921  +0-000 39 
0-361 1-130 32 1:13264 —0-002 32 


and here, a = 0-001, A = (000 39, B = 0-002 32, and hence 
h = (39/271) x 0-001 = 0-000 1. 
The fourth approximation is 0-360 1. 
Note I The difference ¢ — y not only provides the values of 4 and B 


but the change in sign of the difference shows that there is a root 
between the two values of x used in the calculation. 


Note 2 With this method it is necessary to retain more decimal places 
in each successive calculation, five figure tables were used in the 
last approximation. 

Example (ii) Find the real root of 

S(x) = χϑ — 6x2 + 12x — 24 =0 
to two decimal places. 
Solution Since f(4) = —8 and f(5) = +11, there is a root between 


x = 4 and x = 5. Diminish the roots by 4 using the results of section 
12.4.1: 


Ι πὸ -12 --24 

4 +4 -8 +16 

tL τ 44 8] 
+4 48 


1 +6 
then F(z) = 2° + 6z? + 122 — 8 = 0 has a root between 0 and 1. 
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Now successively apply (1) to F(z) starting from the value z) = 0. 
Then 


Ζ F(z) = ζϑ + 623 + 122 -- 8 


0 —8 
1 +11 


and here, a = 1, A = 8, B = 11, hence 
ἢ = (8/19) x 1 = 0-4. 


The second approximation is z, = 0-4, then 


Ζ F(z) 
0-4 —2:176 
0-6 +1-576 


and here, a = 0:2, A = 2-176, B = 1-576, hence 
h = (2176/3752) x 0-2 = 0-12. 
The third approximation is Ζο = 0:52, then 


Ζ F(z) 
0-52 +0:003 0 
0°51 —0-186 7 


and here a = 0-01, A = 0-186 7, B = 0-003 0, and hence 
h = (1867/1897) x 0-01 = 0-009 8. 


(Note that the sign of F(z) when z = 0:52 shows that the third approxi- 
mation has overshot the root.) 


The fourth approximation is z, = 0-519(8). At this stage it may be 
claimed that the root is z = 0-52 correct to two decimal places. The 
root of f(x) = 0 is then 4-52. 

In practice this method is seldom used after the second stage. A 
value having been found the approximation is improved by using a 
more powerful method such as the Newton-Raphson method of 
section 13.2.3. 


13.2.2 Another iterative method 


There is a useful iterative method available when f(x) = 0 can be 
written in the form x = ¢(x). The iterative procedure is given by 


Xn+1 = (Xn) 


where x, is the mth iterate and the procedure ends when the required 
accuracy has been obtained in two successive iterations. 
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The figures 13.4(a) and (Ὁ) illustrate two convergent iterative processes 
Xo, Χι, X2, X3 . . ., One Converging directly to the root and the other 
oscillating about the root value. The Fig. 13.4(c) illustrates a process 
that diverges from the root value. It is obviously possible with this 
process to start with an approximation very close to the required 
value and then to calculate values that are further and further away 
from the true value. It is a necessary and sufficient condition for 
convergence that |f’(x)| <1 in the neighbourhood of the root. This 
criterion is satisfied for the curves of Fig. 13.4(a) and (b) but not for 
Fig. 13.4(c). 


Fig. 13.4 


Example Find the root of the equation 

x? — 2x? + 3x —1=0 
near to 0.5. 
Solution Write 

Xna1 = 41 + 2x2 — x3) 
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and start from x) = 0. The iteration proceeds as follows: 


χα =0 χε = 0-430 

xX, = 03 χε = 0-430 1 
χὰ = 0:38 χη = 0-430 14 
X3 = 0-41 Xg = 0-430 16 
χε = 0-42 Χο = 0-430 160 


The root is therefore 0-430 2 correct to 4 decimal places. This procedure 
does not have rapid convergence but is very simple to apply. 


A major disadvantage of the method is that it may be possible to 
rearrange f(x) = 0 in a number of different ways so as to produce 
the form x = ¢(x) and while one (or more) of these arrangements 
can lead to a convergent process another may lead to a divergent one. 
For the purpose of illustration consider the equation 


x? -—4x¥+3=0 
which has roots at x = | and x = 3. The iteration 
Χχραι ΞΞ 4 -- 3/Xn = $i(x,) 


for which |¢{(1)| > 1 and |47(3)| < 1, will be convergent near the 
root x = 3 but divergent near the root x = 1, i.e. it is impossible to 
locate the latter root using this iteration. However the iteration process 


Xne1 = 3/(4 — x) = pan) 


for which |/(1)| < 1 and |¢3(3)| > 1 will converge to the root x = 1 
and diverge from the root x = 3. 

In employing this method care must be taken to see that the iteration 
is converging to the required root. A rough sketch is frequently an aid 
to the making of the correct choice of function ¢(x). 


13.2.3 The Newton-Raphson method 


Suppose that x» is an approximation to the root of the equation 
J(x) = Ὁ then the Taylor series expansion of f(x) in the neighbourhood 


of x is given by 
fd) = flea) + & τ fe) ἘῸ — P+... ὦ 


If x, is a value of x nearer to the exact root, then (x; — Xo) will be 
small, f(x,) is approximately zero, and providing that f’(x9) is not 
too large the Taylor series may be replaced by 


0 = f(%o) + Gi — Xo)f"(%o) 


Fl). 
FG) 


(3) 


from which Χι ΞΞ Χο - 
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This is the Newton-Raphson rule and repeating for successive approxi- 


mations x1, X2, Xs, . . . leads to the iterative relationship 
70 ὦ 
n+1 n S'(&n) ( ) 


where x, is the nth iterate. The geometrical interpretation is shown in 
Fig. 13.5. 


Ao 


Fig. 13.5 


The ordinate at Ay is f(x) and, since tan 0) = Το), the tangent 
at Ao intersects the x-axis in the point x,. The process illustrated in 
Fig. 13.5 is seen to be convergent. 

As a working rule the number of reliable decimal places in the 
quotient f(x,)/f’(x,) is usually double the number of zeros between 
the decimal point and the first significant figure (1.6. the number of 
correct decimal places is approximately doubled at each iteration). 
The final answer is obtained when two successive iterates agree to the 
requisite number of decimal places. 

It is evident from Fig. 13.5 that the convergence will be rapid provid- 
ing that f’(x) is large near to the root but becomes slower as I’) 
approaches zero. In the latter case it becomes necessary to compute 
I(x,) and f’(x,) to high accuracy to obtain an accurate value of x,41. 

Under certain conditions the convergence may be too slow for hand 
computation or may even diverge. The difficulty is frequently overcome 
by plotting a rough curve to a larger scale near to the root or roots 
and examining the geometrical situation. Different cases of convergence 
are illustrated in Fig. 13.6. 
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In Fig. 13.6(a) the process is convergent, the iterations being succes- 
sively on either side of the root. In Fig. 13.6(b) the process converges 
to the root at R and neither of the roots at P or Q are located, in this 
case a starting approximation much nearer to P (or Q) will lead to a 
convergent process to P (or Q). In Fig. 13.6(c) the process is divergent 
starting from the approximation x9. As in the previous case the root 
can be located by a convergent process starting from an approximation 
nearer to the root itself. 


(c} 
Fig. 13.6 


Example (i) Find the smallest root of 1 + x? — 3 tanx = 0. 
Solution Write 

f(x) =1+ x? —3tanx 
then 70) = 2x — 3 sec? x. 


Take the first approximation to be x, = 0-3 (being the second approxi- 
mation obtained by the rule of false position) then 


f0-3) = 0-16 
03) = —2°58, 
and x, = 0.3 — (0-16/—2:58) 
= 0:3 + 0-06 
= 0:36. 
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With x, = 0-36, 
χα = 0:36 — f(0-36)/f'(0-36) 
= 0:36 — 0-000 4/(—2-705 1) 
‘= 0.36 + 0-000 147 87 
Le. Xq = 0-360 148 = 0-360 15. 
Using five figure accuracy, f(0:360 15) is 0-000 08. The result is prob- 


ably correct to the fifth decimal place but a further iteration would 
be necessary to confirm this. 


Example (ii) Find the root of 
x® — 6x? + 12x —~24=0 
near to x = 4. 


Solution This is example (ii) of section 13.2.1 in which the rule of 
false position was used to show that x = 4-52 correct to two decimal 
places after three iterations. 


Let S(x) = χϑ — 6x? + 12x -- 24 
and 7 (Οὐ = 3x? — 12x + 12 
= 36 — 2)? 


and start with x) = 4, then 
x, = 40 — (—8/12) = 4-6 
χα = 46 — (1-576/20-28) = 4-6 — 00778 = 4-52 (say) 
χα = 4.52 — (0-003 10/19-051 3) 
= 4.52 — 0-000 162 7 = 4-519 84 
X%4 = 4.519 84 — (—0-000 02/19-048 79) 
= 4.519 84 + 0-000 001 049 9 
= 4-519 841 0499 


After four iterations the root is 4-519 84 correct to five decimal places. 


A useful application of the Newton-Raphson rule is to the calcula- 
tion of square roots. If x = 4/a then x is a root of 70) =x? -a=0 
and the rule then states that if x, is the mth iterate 


Xn41 = Xn το SUS (Xn) 
= Xn στ (x3 a a)/2x,, 
Le. χει = HX, + a/x,). (5) 


Thus, if x, is an approximation to the square root of a number a then 
the mean of x, and the quotient a/x, is a better approximation. Each 
iteration is correct usually to twice as many decimal places as the 
preceding iteration. 
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Example (iii) Find +/S. 
Solution Take the first approximation as xp = 2, then 

x, = 42 + 2:5) = 2:25 

χα = 1(2:25 + 2-222 2) = 2-236 1 

χε = $(2:236 1 + 2-236 035 96) = 2:236 067 98 
The result 2-236 1 is correct to four decimal places. 


Note 1 A major advantage of the Newton-Raphson method is that 
any error made is not disastrous providing that the process, starting 
from the incorrect value, remains convergent. 


Note 2 If f’(x)is large the rate of convergence is large and the denomina- 
tor term f’(x) may be conveniently replaced by a suitable constant. 
For instance, in example (ii) above, f’(x) could be replaced by 20, 
the successive iterates being x) = 4, x, = 44, x2, = 4.509, x3 = 
4-519 1. 


Note 3 Ameasure of the accuracy of the iterate x, is given by the value 
of f(x,). This value is called the residual and usually denoted by R. 


Note 4 If the coefficients of the equation f(x) = 0 are known only to a 
certain accuracy then the approximations to the roots of the equation 
cannot be calculated with certainty to a greater accuracy. For 
instance if the coefficients are known to be correct only to two 
decimal places then there is no advantage in calculating the roots 
to many more decimal places. 


Note 5 If (a) a small change in the value of the coefficients of f(x) leads 
to a large change in the value of the roots, or 
(b) small values of the residual R can be obtained even when 
the values of x, are widely different from the true value of the root, 
then the equation f(x) = 0 is said to be ill-conditioned. Equations 
that are ill-conditioned are difficult to deal with by any method of 
numerical analysis. In the Newton-Raphson method it frequently 
means that f(x) and f’(x) are both small and that the calculations in 
each iteration must be carried out with the retention of many more 
decimal places than can be claimed with accuracy in the result of the 
iteration. 


13.3 NUMERICAL SOLUTION OF LINEAR ALGEBRAIC 
SIMULTANEOUS EQUATIONS 

The elimination method of Gauss was used in section 8.6.1 to solve 
a set of linear simultaneous algebraic equations. The numbers given 
there were simple integers and the calculations were performed using 
only mental arithmetic. The matrix scheme can be used even when the 
elements are decimal but it is more convenient to draw up a computa- 
tional scheme and to introduce a system of checking. This scheme will 
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be discussed in section 13.3.1. As in section 13.2 it is possible to form 
an iterative process that will converge to the solution of the set of 
equations and two such methods are discussed in sections 13.3.2 and 
13.3.3. 


13.3.1 Gaussian elimination method 


Given a set of three equations in three unknowns x, y, and z: 


4:12χ — 9-68y + 2-01z = 4-93 
1-88x — 4-62y + 5-50z = 3-11 (6) 
1:10x — 0-96y + 2:722 = 4-02 


the basic process is first to eliminate x and y and form an equation 
in z only, second to use this to form an equation in y only and finally 
to form an equation in x only. In terms of matrix algebra this is equiva- 
lent to transforming the coefficient matrix into an equivalent triangular 
matrix. 

A simple check is introduced, and assuming no numerical mistakes 
are made (or left uncorrected) the errors in the final answer are due 
solely to rounding off errors in the multiplication and division used. 
Finally an estimate of the error may be formed and a correction 
applied. The computational scheme proceeds as follows: 


Equation | x y -Ζ δ σ RD R®) Operation 

6.1) 41 —9-68 2°01 4:93 1:38 0-006 0-000 1 

(6.2) 188 --4:62 5:50 311 5:87 --ΟὝὉΟΙ 0-000 3 

(6.3) 1:10 —0:96 2°72 4:02 6:88 0-000 0-000 4 

(6.4) 1000 —2-350 0-488 1-197 0-335 0-001 5 (1) + 412 

(6.5) - 1.880 4.418 --Ο917 --2.250 —0-630 —0-0028 (4) xX —1-88 

(6.6) 0000 —0-202 4:583 0-860  5:240 —0-0038 6) + Ω) 

(6.7) “1100 2.585 —0-537 ~—1:317 —0:369 —0-0017 (4) Χ —1:10 

(6.8) 0-000 1°625 27183 2-703 6.511 —0-0017 4) τ M 

(6.9) 1-000 1.343 1.663 4007 —0-001 0 (8) + 1.625 

(6.10) 0.202 0-271 0.336 0-809 —0-0002 (9) x 0-202 

(6.11) 0-000 4-854 1-196 6049 —0-0040 (10) + (6) 

(6.12) 1-000 0-246 1246 —0-0008 (11) + 4.854 

(6.13) 1-000 1:333 2:334 +0-0001 (9) -- (2) x 1-343 

(6.14) 1-000 4-210 5212 +0°002 1 (4) + (13) x 2-350 
—~ (12) Χ 0-488 


The first estimate of the solution is given by the δ column of (6.12), 
(6-13), and (6.14). The columns marked R are columns of residuals but 
are not computed until after the first estimate is found. The columns 
x, y, Ζ, and 6 together form the coefficients of the augmented matrix 
of the set of equations. The column marked o is the check column and 
the first three entries in this column are simply the row sums of the 
entries in the previous four columns. The elimination process proceeds 
by performing the operations as indicated in the last column, the aim 
being to eliminate the successive variables until the final coefficient 
matrix is in a triangular form. As an aid to computation it is useful 
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to reduce to unity the coefficient of the variable being eliminated. At 
each stage of the elimination the entry in the o column is computed 
by performing the operations upon the previous entries of that column. 
This entry is then checked against the sum of the other entries of the 
row. For example, in forming row (6.9) the entry o, is given by og + 
1-625, 1.6. 6-511/1-625 or 4-007. This entry is then checked against the 
sum of the other entries in that row, namely 4-006 which indicates 
that there is probably no major numerical error. 
The first estimate of the solution is 


x1 = 4210, γι = 1-333, and z, = 0-246 


with the third decimal place in doubt. The column of residuals R® 
is now computed as follows. The first entry is obtained by inserting 
the estimates x,, y;, and z, into the original equation (6.1) written in 
the form 


4-12x, — 9-68y, + 2012, — 4-93. 


The second and third entries in the R® column are found in like 
manner from the original equations (6.2) and (6.3). The first three 
entries are 0-006, —0-001, and 0-000. Note that if the estimates had 
been exact then the residuals would all have been zero. The remaining 
entries in the R® column are computed by performing the appropriate 
operation on the previous entries. Finally, the entries b of (6.12), 
(6.13), and (6.14) are adjusted by subtracting the corresponding residual 
value. The second estimate of the solution is then 


χα = 4210 — 0-002 1 = 4207 9 
Yq = 1.333 — 0-000 1 = 1-332 9 
Za = 0-246 + 0-000 8 = 0-246 8 


The first three entries in the second column of residual values may now 
be found by inserting these new estimates into the original equations. 
The new residuals are 0-000 1, 0-000 3, and 0-000 4 respectively. The 
rest of the column may then be computed and the third estimates 
found. 

The above method of solution solves the set of equations in one 
computational process and, providing the equations are not ill- 
conditioned and sufficient accuracy is maintained in all multiplications 
and divisions, the first estimate obtained should be very close to the 
correct answer. In contrast to this process it is possible to formulate 
an iterative process starting from a rough estimate of the solution and 
then gradually converging to the correct result by successive calcula- 
tions of the same kind. There are two basic methods in common use, 
both modifications of a method originally due to Gauss. 
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13.3.2 The Gauss—Jacobi method 
Suppose that a set of equations in three unknowns is given in the 
form 
A21X + Agey + 4292 = δὰ (7) 
@giX + Aggy + 552 = bg 
and that the equations are already in the form, or by suitable adjust- 
ment of the augmented matrix may be put into the form, in which the 
diagonal terms of the coefficient matrix are large compared with the 
other terms. Then the equations may be written 


x"t) = (b, — aeoy” - “ea 


4X + Aoy + α152 = | 


YOR = (δὲ = Ag x — 242} α52 

204) = (bg = agyx — as2y)/ag3 
where x, y™, and z are the rth iterates. The procedure starts by 
assuming an estimate x, y, and z, which, in the absence of any 
better estimate may be taken as 0, 0, and 0.. The second estimates may 
then be found from equations (8). The results are set out in table form 
and it is usually soon evident whether the process is convergent or not. 
It is possible for the process to be slowly convergent (as for example 
when the diagonal coefficients are not much larger than the other 
coefficients of the matrix) or even to diverge. 


For purposes of illustration of the method, consider the following 
example. 


(8) 


Example Solve the set of equations 
ὃχ — y+ z—18=0 
2x + Sy — 2z — ἘΠ 
x+ y—3z+ 6=0 
Solution The iterative scheme is applied to the equations 
x= (y—z + 18)/8 | 


(9) 


y = (—2x + 2z + 3)/5 
z=(y+x + 6/3 


XD = O125y) — 01252 4 oa 


(10) 
Then, 
yt) = —0-400x + 0-400z + 0-600 


ZO) = 0-333y 4 0-333x + 2-000 


and the iteration starts with x = y = 2 = 0, Successive iterations 
are performed and the results are given in the table below 


(11) 


Iteration | 0 1 2 3 4 
x 0 2-250 2-075 1-943 9 2011 4}, (12) 
y 0 0-600 0-500 0-949 6 0-965 7 
Ζ 0 2000 2-949 2.858 2 2°964 4 
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The process is clearly convergent, the actual values being x = 2, 
y=l,andz = 3. 


13.3.3 The Gauss-Seidel method 


This method is a modification of the Gauss—Jacobi method. The set 
of equations (7) are written in the form 


xt) = (ῤ, — apy” = 4432) ay, 
VOD τῷ (bg — gy ΧΕ) — Aggz\) 658 (13) 
Ζ τ) = (bg — gy x"tY — ago yt) 458 


which takes the place of equations (8) of section 13.3.2. 

The procedure starts as before with an initial estimate x, y, and 
2.9) and then x is found from the first of the equations (13), y“ from 
the second and z“ from the third. It will be noted that this calculation 
for the second estimate differs from the previous method in that the 
current values of the variables at each stage of the iteration are used 
in proceeding to the next stage of the iteration. The Gauss—Jacobi 
iteration proceeds by whole steps whereas the Gauss-Seidel method 
proceeds by single steps. The advantage of the latter is that convergence 
is more rapid but there is the disadvantage that the calculations are 
not so simple to perform. 


Example Starting from the set of equations (10) with the first estimates 
xO) == yO == 7) = 0, solve the equations (9). 


Solution The following table may be formed using equations (13), 


Iteration 0 1 2 3 4 5 
x 0 2-250 1.8813 1.99724 1.997 86 1-999 83 
γ 0 —0:300 0:9075 0:97294 0:996 88 0-999 37 
Ζ 0 2.650 29296 2:99006 2.998 25 2-999 73 


In five iterations the answer is close to the exact result of x = 2, 
y = 1, and z = 3. The result of the third iteration is better than that 
of the fourth iteration using the Gauss-Jacobi method. 

Both the iterative methods are suitable for use with a high speed 
computer. 


13.4 NUMERICAL SOLUTION OF DIFFERENTIAL EQUATIONS 
The numerical solution of a differential equation of the nth order is 
usually obtained by solving an equivalent system of » first order 
differential equations. Consider for example, a general third order 
differential equation expressed in functional form 


ae) a 
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Using the substitutions dy/dx = z; dz/dx = w means that the equation 
(14) may be replaced by the system of equations 


dy 

7 2(x, y) 

dz 

i w(x, γ, Z) (15) 
Ww 


ak = f(x, y, 2, W) 


The solution of (14) may then be considered as equivalent to solving 
the three simultaneous equations (15). It is necessary then to discuss 
the numerical solution of a general first order differential equation of 
the form 


13.4.1 Euler’s method of solution 
Consider the equation 


d 
a = f(x, y) (16) 
together with its boundary conditions 
γ) Ξε )γο; X = Xo (17) 


Suppose the solution of (16) is represented by a function 


y = φ0), ¥ = Xo) = Yo» 
then the solution would be represented graphically by the integral 


curve passing through the point (xo, Yo). Such a curve is illustrated in 
Fig. 13.7. If the function ¢(x) can be found analytically then the 
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equation (16) is solved and there is no need for a numerical solution. 
In practice however the function ¢(x) may not be available so that the 
Fig. 13.7 is simply a theoretical solution curve. If x,(=x 9 + 4) is the 
abscissa of a point P on the solution curve near to (Xo, Yo) then the 
actual ordinate of P is d(x,) or φ(χο + A). 

The Euler method of solution is a simple computation that gives an 
approximation to the actual value φίχο + ἢ). Analytically the approxi- 
mation is to take the first two terms in the Taylor expansion of 
φίχο + 1) (see section 5.2), i.e. 


Hx + ἢ) τε Plo) + ἀφο = Yo +h (2) 


Xo Yo 


This value is then taken as the value y, corresponding to x, i.e. 
Vi = Yo + hf(Xos Yo) (18) 


Graphically the process is to draw the tangent line to the integral 
curve at the point (Xo, yo), as in Fig. 13.7, and to construct the point 
Px where the tangent line intersects the ordinate at x, = x) + ἡ. For 
this reason the Euler method is sometimes called the tangent method of 
approximation. The whole process may now be repeated starting from 
the value (x;, γι) to compute the next approximation given by (Xa, ya) 


Fig. 13.8 
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where yo= ji + Aft, yr); Xy = Xo + h, X_g = Xo t+ 2h (19) 


The successive points (Xo, Yo), (X1, ¥1), (%2, V2) are then approximations 
to the actual points on the solution curve through (xo, yo) as in Fig. 
13.8. 


If y, is the approximation to the solution of (16) at the point 
Xp = Xo + nh starting from the point (xo, yo), then the next point is 
(Xn+1 Vn ww such that 


Vn+1 — Ya + Af(%ns Vn) (20) 


Example (i) Find an approximation for y(0-3) if 


dy|dx = 2x|y 
and y(0) = 2. 
Solution 
Here 
S(x, y) = 2x/y and xo = 0, yo = 2 
Choosing h=01, 
then x, = 01 
and Yi = Yo + hf (xo; Yo) 
= 2 + (0-1)(0) 
then χε = 0-2 
and Yo =i + Af(%r, i) 
= 2 + (0:1)(0-2/2) 
= 2:01; 
then X3 = 0-3 
and Ya = Yo + hf(xe, ya) 
= 2:01 + (0-1)(0-4/2-01) 
= 20299. 


Hence y(0-3) is approximately 2-03. 

The differential equation of Example (i) can be solved analytically by 
separation of variables (see 11.1.2) and the solution is y?/2 = x? + A. 
With y(0) = 2, then A =2 and hence the particular solution is 
y? = 203 + 2). The actual value of y(0-3) is thus »/4-18 or 2-045 to 
three decimal places. The solution of Example (i) may be written in 
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table form as follows 


n 0 1 2 3 
Xn 0 0-1 0-2 0-3 
Vn 2 20 2:01 2-03 

h 0-1 0-1 0-1 


S(%a: yn) | 0 0-1 0-199 


Vn+i 20 2-01 2:0299 
Example (ii) Find an approximation to y(0°6) if 
d 


and y(0) = 0. (Use an interval 4 = 0-2). 


Solution Setting the solution out in tabular form and working to three 
decimal places 


n | 0 1 2 3 
gs ag ee ee τες 
Xn 0 0-2 0-4 0-6 
Yr 0 00 004 0120 
h 02 02 0-2 
αι Jd) = te te 0 0-2 0-402 
Vast = Yn + Af (%ns Vn) 0 0:04 0-120 


The result is y(0-6) = 0-120. 

There is no simple analytical solution to the differential equation of 
Example (ii), but this is the differential equation of Example (iii) of 
section 11.7 and Fig. 11.7 indicates the solution using isoclines. When 
x = 0-6 on the relevant solution curve (4) which passes through (0, 0), 
y is approximately 0-2. On this curve, it is seen that y increases rapidly 
with x and it is to be expected that Euler’s method would seriously 
underestimate the true value of y(0-6). 


13.4.2 The Improved Euler method 


The Euler method of 13.4.1 is not very accurate, and the Fig. 13.8 
illustrates how the successive approximations (x, Vi) (Xa, V2), (X35 Ya) 
can deviate from the correct positions on the integral curve through 
(xo, Yo). An improvement to the basic Euler method is the following: 
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(i) Use the basic Euler method as in Fig. 13.9 to estimate the value 
of y at x; (=Xo + A). Call this point (x, y1”). 

(ii) Use this latter result to estimate the value of the slope of the 
integral curve at the point (x;, y1°). Call this yy’. 

(iii) Compute the average of the slopes of the integral curve at 
(Xo, Yo), 1.6. Yo’, and the integral curve at (x, yi"), Le. yy’. 

(iv) Use the basic Euler method at the point (xo, yo) with the average 
slope value, i.e. $(yo’ + yr’), to estimate (%4, yi). 

(v) Repeat the whole process to estimate the next point (x2, ya). 

In the Fig. 13.9 the ordinate length y, is given by 


γι = Yor shlf(xo, Yo) + f(x, 179] 


slope $(yjt γὴς- “; 
ΒΡ αν οἱ 


΄ 
“ε 


Fig. 13.9 


The formula corresponding to (20) is 


Yn+1 = Vn Ἕ ἐλ{{(,. Vn) + f(Xn+1 rar (21) 
where γῆκει = Vn + f(a, Va) (22) 
and Xn = Xo + nh 


The superfix P, as in y,”, is used to indicate the predicted value of the 
ordinate y,. This value is then corrected by using the average of the 
slopes. The technique is a simple case of the predictor-corrector method 
of finding an approximation. 


Example (i) Repeat Example (i) of 13.4.1 to find y(0-3). 
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Solution 


Here dy/dx = f(x, y) = 2x/y and x» =0, yo=2. With A=0-, 


then x, =O, γι} = 2+ (0-1)(0) =2 

and f(%1 γι) = 20-12 = 0-1. 

Hence Vi = 2 - (0-10 + 0-1)/2 
= 2005. 

Then Xp = 0-2, yo? = 2005 + (0:1)(0:2/2-005) 
= 2015 

and S (X25 Yo?) = (0°4)/2-015 = 01985. 

Hence Ya = 2-005 + (0-1)(0-0997 + 0-1985)/2 
= 2020. 


Then χα 0:3, yg? = 2020 + (0-1)(0-4/2-020) 


= 2:020 + 0:0198 


= 2-0398, 

and Ya = 2020 + (0-1)(0-198 + 0-6/2:04)/2 
= 2:0446. 

Hence, y(0-3) = 2045 


This result is ν 4:18 correct to 3 decimal places. 


Example (ii) Repeat Example (ii) of 13.4.1 to compute y(0-6). 


Solution 


Here dy/dx = f(x,y) = x+y? with x) =0, Yo = 9. The calcu- 


lation is set out in tabular form which is self-explanatory. 


n 0 1 2 
Xn 0 02 04 
Vn 0 0:02 0-080 0-183 

h 02 02 02 

S%ns Yn) = Xn A Vn? 0-0 0:2 0.406 
Xn43 02 04 06 

Var = Vn + Af(%ns Yn) 0-0 0:06 0:16] 

SI Xnsr Vea) = χὰ + (VP, 3)? 0-2 0-404 0-626 


Yns1 = Yn Ὁ Fhlf%ns Va) +f%n+1 VF 41)] | 0-02 0-080 0-183 
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The improved Euler method gives y(0-6) = 0-183 as the solution of 
dy|dx = x + y?, y(0) = 0, a result which should be a better estimate 


than the value 0-120 of 13.4.1. 


Example (iii) Use the modified Euler method with ἢ = 0-1 and work 


to two decimal places, to estimate y(4) given that 


Ζ 


ieee ae y(0) = 0. 


Solve the equation Rae and hence obtain an estimate to e. 


Solution 


Here f(x, y) = 1 + y and the result is tablulated below 


n | 0 1 2 3 4 5 
Xn 0 0-1 0-2 0.3 0.4 0.5 
Vn 0 0:10 0:22 0-35 0-49 0-65 
h 0-1 0-1 0-1 0-1 0:1 
S(Xns Yn) 1-0 1-10 1-22 1:35 1-49 
Xne1 0-1 0:2 0:3 0.4 0:5 
yeaa 0-1 0:21 0:34 0-48 0-64 
S (Xn egy} 11 1:21 1:34 1:48 1-64 
Yn+1 0-10 0-22 0-35 0-49 0-65 
Euler’s improved method gives the estimate 
yh) = 065. 
Analytically, o =I+y 
x 


dy 
or [---[4 


and hence log (1 + y) = x + constant. 


The solution is 1 + y = Ae’, where A is a constant, and since 
y(0) = 0 it follows that A = 1, 1.6. y =e? — 1. Using the calculated 


estimate 
οὐ Ξε 1+ y(4) 
ΞΕῚ τ 0-65 
= 1-65 
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and hence, e = (1-65)? + 2:72. 
This is the actual value of e, correct to two decimal places. 


13.5 ERROR AND PROPAGATION OF ERROR 


It has been assumed in the preceding sections that the supplied data 
are exact. For example the coefficients of equation (6) of 13.3.1 were 
assumed to be precisely those numbers. In any practical problem it is 
likely that such coefficients have been obtained as the result of an 
experiment, for example as the readings on a meter. The data are not 
exact but are approximations to some probably unknown values. Most 
engineering measurements however precise they may be are subject to 
some form of approximation and it is necessary to know how the use of 
the approximation, instead of the exact value, affects subsequent 
calculations performed with the data. 

Suppose the exact (but possibly unknown) value of some quantity is Ν᾽ 
and the approximation to this value is n, then the difference between 
nand N is called the absolute error in n, i.e. if this error is denoted by 


é then Her we (23) 


13.5.1 Errors 
Errors may be introduced in many different ways; 


(i) Human error is obvious and is particularly common, 

(ii) Round-off error is due to the need to express a number in 
decimal form and with a convenient number of decimal places. 
It is often introduced into a calculation when an exact fraction 
is written in decimal form. For example, the numbers 0:3, 0-33, 
0-333 are successive approximations to the exact number 1/3, 

(iii) Errors may be introduced due to the nature of the method of 
computation used, 

(iv) There may be errors inherent in the data being used, due perhaps 
to experimental measurement. 


The errors due to (i) may be reduced by checking and cross-checking 
and those due to (iv) may need a full statistical analysis. The chief 
concern here is with errors due to (ii). 


13.5.2 Round-off errors 


The number 130/63 can be expressed as a non-terminating decimal 
2-063492063.. . . In order to use such a number in a computation it 
will usually be written as 2-06, 2-0635, 2-063492 with successively more 
accurate approximations. The approximations are the result of rounding- 
off to 2, 4, or 6 decimal places. Sometimes the term significant figures 
may be used instead. The three approximations above would then have 
3, 5, and 7 significant figures respectively. 

Example (i) The numbers 12-32, 0-0134, 125-0 are said to have 4, 3, and 
4, significant figures respectively. 
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It is useful to represent numbers in the form M x 10", where 
1 <M < 10 and mis an integer, to indicate the number of significant 
figures. Thus in the above example the numbers are respectively 
1.232 x 101; 1:34 x 1072; and 1-250 x 10?. All the numbers retained 
in M are significant. Note that the zeros of the second number are not 
significant whereas the zero of the third number is significant. In the 
former case the zeros merely locate the decimal point but in the latter 
case the zero serves to distinguish the number from any other number 
such as 125-4 or 125-9. 


Example (ii) Express 438-968 correct to 2 and 1 decimal places. Also 
express the number correct to 4, 3, 2, and 1 significant figures. 


Solution 
The number is 438-97 to 2 decimal places 


and 439-0 to 1 decimal place. 

The number is 4-390 x 10? to 4 significant figures, 
and 4-40 x 10? to 3 significant figures, 
and 4-4 x 10? to 2 significant figures, 
and 4 x 10? to 1 significant figure. 


The round-off rule to ἡ significant figures is to retain » significant 
digits and discard the remainder. If the discarded number is 


(i) less than half a unit in the mth place leave the nth digit unchanged, 

(ii) greater than half a unit in the nth place increase the nth digit by 1, 

(iii) exactly half a unit in the mth place, add either 0 or 1 to the nth 
digit to make it an even number. 


Example (iii) The number 13-1535 is 13-154 to 5 significant figures; 
13-15 to 4 significant figures; and 13-2 to three significant figures. 
Example (iv) Express 14-2545 to 5, 4, and 3 significant figures. 
Solution Rounded-off to 5 significant figures the number is 14-254, the 
discarded remainder being exactly } unit in the Sth place. 

Rounded-off to 4 significant figures the number is 14-25, the discarded 
remainder 0-0045 being less than } unit in the 4th place. 

To 3 significant figures the number is 14-3, the discarded number 
0-0545 being greater than ὁ unit in the 3rd place. 


13.5.3 Absolute, relative, and percentage error 
The absolute error has already been defined in equation (23) as 
é =n — N. If only the magnitude of the error is important then 

je] = |n — N| (24) 


is called the absolute error modulus. 
The absolute error is not necessarily a good indicator of the error. 
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For example, two measurements may be one metre and one kilometre, 
each with an absolute error of one centimetre but it is obvious that the 
former error is more significant than the latter. A better indicator in this 
case is the relative error r given by 


r=e/N (25) 
In the above illustration the relative errors are 
r= 1/100 = 0-01 (for N = 1 metre) 
and r = 1/100000 = 0-00001 (for Ν = 1 kilometre) 


In practice the number N may not be known but only its approxi- 
mation ἡ, in this case the equation (25) is written 


r=e/N 
e[(a — ε) = (e/n)(1 — εἰμ)" 1 
(e/n\ + efn + ει +...) 
Thus a good indicator of the relative error is 
r=eln (26) 


The percentage error is simply 100r. The percentage error for an absol- 
ute error of 1 centimetre in one metre is 1% and for one centimetre in 
one kilometre is 0-001 %. 


13.5.4 Accuracy 


This is not simply the absence of error. For example a clock may be 
said to be accurate if it does not gain or lose more than, say, 3 seconds 
in 12 hours. In any 12-hour interval the absolute error ¢ of the clock 
is then such that —3 Ξε < +43, ie. εἴ lies within a band of values 
(—3, +3). This band is called the error interval. If the clock was 
correct at midnight then the accurate time T hours, when the clock-time 
is t(<12 hours) would be in the range 


T = t + 3/3600 or t — 3/3600 <T <t + 3/3600. 


Example Suppose the number 2:67 is the result of a rounding-off to 
two decimal places. The absolute error bound is 0-005, i.e. the correct 
number N lies within the band of values 


(2:67 — 0005) to (2:67 + 0-005) 
ie. 2-665 < N < 2-675 


13.5.5 Propagation of errors 


The basic algebraical operations are those of addition and subtraction, 
multiplication and division and these lead to propagation of errors. 
Errors may also be propagated by the use of other functional relation- 
ships, indices, powers, roots, etc. 
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(a) Addition and subtraction 
Suppose N, and N, are exact, then Ng where 
N3 = Ny + Ne 
is also exact. If m, and 5 are approximations to N, and Nz with absolute 
errors εἰ = 1, — N, and ες = nz — Nz then ng is the approximation 
to Ng with an error €, given by 
€g = Ng — Nz 
= (m + mg) — (Mi + No) 
= (m, — My) + (τς — No) 
i.e. &g = & + ξὰ (28) 
Thus, in an addition (subtraction) the absolute errors are added 
(subtracted). 
In many cases it is sufficient to use the absolute error moduli [ε}], 
[64], and |eg| and the result 
[69] < [1] + [ea]. (29) 
The result (29) simply states that the worst possible error will occur 


when the errors are additive. For example, in a subtraction the errors 
are additive when they are of opposite signs. 


Example (i) The two numbers 59-82 and 0-56 have been obtained by a 
rounding-off process. Estimate the error in the addition 


59-82 + 0:56 = 60:38 


Solution The maximum error modulus in 59-82 is |e,| = 0-005 and the 
maximum error modulus in 0-56 is |e2| = 0-005 also. If the error 
modulus in the addition is |es|, then 


jes] «-- 0005 + 0-005 = 0-01. 


The result of the addition is then 60-38 + 0-01, i.e. the second decimal 
place is in doubt. A more meaningful answer would be 60:4 correct to 
3 significant figures. The relative error is 0-01/60-38, i.e. 0-0002. 


Example (ii) The numbers 70:03 and 69:25 have been rounded-off to 
2 decimal places. Express the difference of the two numbers to a 
meaningful number of significant figures. 


Solution The difference 0-78 has an error €3 and 
[4] < 0-005 + 0-005 = 0-01 


The last digit is again in doubt and the difference lies in the band of 
values 0-77 to 0:79. Consequently the result is 0-8 correct to I significant 
figure. Notice that the original numbers are of comparable size and 
each has 4 significant figures but the difference has only 1 significant 
figure. There can be a large loss of significant figures in a subtraction 
and this can be a source of serious error in an involved calculation. 
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(b) Multiplication and division 
Suppose that nj = n,n, is an approximation to the exact result 
Nz = N,N, and that 
δὶ ΞΞ ἥ1ι — Ν, 
δ = Ng — No 
€g = Ng — Nz 
It follows that 
δ = ἥᾳ — Ng 
= NyNz — Ν ΝᾺ 
= MNg — (μι — &)(M2 — ε3) 
= MyNz — (NyNg — M2 — Eghy + 8183) 
= Ng + Egl, — διδῶ 
If, as is frequently the case, 818 is small compared to the other term 
then 
&3 ΞΖΞ δι + Eqhy (30) 
is a measure of the absolute error involved. Dividing both sides of (30) 


by 3(== 1,72) then an estimate of the relative error involved in the 
multiplication is given by 


δ δι)  Eqhy 

Mg My Ming 
i.e. rg=n tre (31) 
Similarly, in a division, suppose that ny = πῃ (πα is an approximation to 
Nz = ΝΙΝ, then 

&3 = Ng — Ng 
πε π — ΝΙΝ, 
Ξε πη{π2 — (πα — &)/(ng — £2) 
= πχίπ — (πῃ {π2}{(1 — εὐ[π0}(1 — δα π2) 1] 
= πιίπα — (πι{π4}{] + εαίπα -- [my — &€2/mnz +...) 
(no/na)[er/ny — €2/ne]) +... 
An estimate of the relative error in a division is given by 
rg =r, — re (32) 


Thus in a multiplication (division) an estimate of the error is obtained 
by adding (subtracting) the relative errors. 

In many numerical calculations it may not be known whether the 
error is positive or negative and the only meaningful error measure is the 
absolute error modulus [ε[ or the relative error modulus |r|. The two 
estimates (31) and (32) are frequently replaced by their cruder, but 


Ι 
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simpler, forms 

Iral < |r| + [γα]: (33) 
Note that |r, + ral <|n| + [γα and |r: — rel < |r| + |re| (See also 
Section 10.3.4). 
Example (iii) The numbers 1-3 and 2-4 are the result of a rounding-off 


process. Give the product 1-3 x 2-4 to a meaningful number of signi- 
ficant figures. 


Solution If the numbers are assumed accurate then 1:3 x 2:4 = 3-12. 
Due to the rounding-off process each number has an absolute error 
modulus of 0:05. The relative error moduli are 


In| = ἜΣ = Ο038 and [χω] = Σ 
respectively. From (33) an estimate of the relative error modulus of the 
product is Iral <|ral + [ral = 0-059 
The absolute error modulus is then estimated as 

[64] < 0-059 x 3-12 = 0-184]. 
The product N = 1-3 x 2-4 has an estimated value given by 
3:12 —0:18 <N «-.-3512 4 0-18 
i.e. 2:94 < N < 3-30 


The result of the multiplication to a meaningful number of significant 
figures is thus N = 3. 

Instead of working in terms of the relative error (with the simple rule 
of addition (subtraction) in the case of multiplication (division) it is 
often convenient to work with the absolute error. 


= 0-021 


Example (iv) Estimate 3-2/0-411 if the numbers are the result of round- 
ing-off. 


Solution The quotient is N and 
3.2 + 0-05 
0-411 + 0-0005 
_ G2 + 0-05)(1 + 0-0012)-* 
" 0-411 
= (3-2 + 0-05)(1 F 0-0012. . .)/0-411 


or N = [32 + (0:05 + 3-2 x 0-0012)]/0-411 (for maximum possible 
error). 

Hence, N = 7786 + 0-131 and N has a value in the range 7-66 to 
7-92. Thus it is possible to claim that N has the value ὃ correct to 1 
significant figure. 


N= 
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(c) Functional relationships 


Suppose that πὸ = f(m) is an approximation to the exact result 
Nz = ΚΝ) where f(x) is some functional relationship of x. If n, has 
an absolute error ¢,, and 2 an absolute error ¢, then 


δ = Ng — Nz = f(m) — ΚΝ) 
= f(m) —f(m — 4) 


1 
= fm) — Yeo) — afr) + Fafa). - .] 
assuming the Taylor expansion of section 5.2. Thus 


δ = &,f'(m) (34) 
Example (v) If 1-377 is the result of a rounding-off to 4 significant 
figures, estimate the absolute error modulus of log 1-377 and hence give 
the result to an appropiate number of figures. 
Solution From (34) 
[ει = 0-0005 
and Sf) = 1/1-377. 
The estimate for |¢,| is then 
|€2| <= (0-0005)(1/1-377) 
==0-0004. 
Using 4 figure tables 
log 1-377 == 0-3200 + 0-0004. 


The result is 0-320 correct to 3 significant figures. 


13-6 ERRORS IN ITERATIVE PROCESSES 


An iterative process for calculating a root of f(x) = 0 is often written 
in the form 


cree 16 (35) 
For example, in 13.2.2 F(x,) is 6(x,) where 
x — $(x) = 70) 
while, in 13.2.3, F(x,) is the expression given by (4), i.e. 
FXn) = Xn — (αὐ. 


Suppose the absolute error in the iterate x, is ¢, then, if X is the correct 
value, (35) leads to 


X + fn41 = F(X + €,) 


1 
= F(X) + F(X) + FF +... 
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which, with ¥ = F(X) reduces to 
Ener = δ (X) + 46,” ΕΓΔ. 
For the iterative method of 13.2.2, F(X) = ¢(X) 


ie. F(X) =4(X) 
and Ensi  EnP' (X)  ἐε, Φ'(Χ) 
i.e. Ena1 = €,6'(X). (36) 


From (36) it may be noted that if |¢’(X)|<1 then 
lensil < [En 


and the error decreases as the process proceeds i.e. the process is 
convergent. 
For the iterative method of 13.2.3 


F(X) = X — ΚΧῚ Γ(ΔῚ) 


and hence 
F(X) = 1 —f(OIF (XY) + fOPCOKS OY 
Ξ- κα (Δ 
Ξε 0 


since Χ(1) = 0. 
It follows that 


FUX) = fCOPCOKF OVP 


=f OIF (X) 
and so En+1 AE? (37) 
where A= f"(X)2f'(X%. 


From (37) it may be noted that the error in the (” + 1)th iterate is 
dependent on the square of the error in the nth iterate. This is one 
reason why the Newton-Raphson method of 13.2.3. is usually a 
rapidly convergent process. However the form of A also shows that if 
Κα ΚΓ (ΔΊ is too large (i.e. if f’(X) is too small and/or f’(X) is too 
large) then the process could diverge. The equation (37) shows that if 
Ag, > 1, 1.6. €n41 > &, then this divergent process is possible. For 
example if A = 15 and é 9 = 0-1 then (37) gives δ 0°15, 2, = 0-33, 
and ¢3;=~ 1-5 and the process diverges. Such a divergent process is 
illustrated in Fig. 13.6(c) of section 13.2.3. It may be possible to prevent 
this divergency by starting from a better approximation. For example, 
if A = 15 but e, = 0-05 then ε; = 0-04, e, + 0-02, and ες = 0-006 and 
the process converges. 


352 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


EXERCISE 13 
1 Show that the equation x + 2 sin x = 1 has a root between 0 and 1 
and use the rule of false position to find it to two decimal places. 
2 Show graphically that 2 cos 2x = e* has only one positive root 
but an infinity of negative roots. Use the rule of false position to 
find, to two places of decimals, the two roots of numerically least 
value. 


3 Find the root of Question 1, correct to four places of decimals, 
by the Newton—Raphson method. 


4 Show that 4x* + 3x? — 6x — 1 = e7* has two negative roots and 
one positive root, and use the Newton—Raphson method to find 
the negative root between —1 and 0 to three decimal places. 


5 Evaluate 1/7 to four significant figures by the Newton-Raphson 
method. 


6 Show that if x, is an approximation to (a then a better approxima- 
tion is given by 
χρίει = 3[2x, + a/x?]. 
Evaluate ~/10 correct to four significant figures. 
7 Use the iteration procedure, x,,, = ¢(x,), to find the positive 


root of the equation 6x = log (x + 2) correct to four decimal 
places. 


8 Find, to four decimal places, the smallest positive root of 
χϑ — 3x + 1 = 0 using (i) the iteration procedure of section 13.2.2, 
and (ii) the Newton~Raphson method. Find also the negative root 
of the equation. 

9 Use Gaussian elimination to solve the following equations 

1-23x + 0-86y + 0.302 = 3:50 
0-06x — 2:0ly — 0:26z = 2:62 
0-39x — 0:24y + 1:53z = —1-98 
Use a check column and calculate the first estimates to three 


places of decimals. Calculate the residuals and form the second 
estimates. 


10 Solve the equations of Question 9 using (i) the Gauss—Jacobi 
method, and (ii) the Gauss-Seidel method. 

11 Obtain x, X2, x3, and x, correct to three places of decimals by the 
Gauss-Seidel method, given 

20x, — X2 + X3 -- xX, = 

Xy — 40%. + 4χΧ35 Ὁ X= 

—xX,+ Xx2+ 10x, ΞΞ 

Χο -- x3 + 8x, = —5. 


HR Ὁ οο 
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Use Euler’s method with step length 0-2 to solve 
dyjdx+y?=0, y(0)=1 


for values of x up to x = 1 (work to two decimal places). Integrate 
the equation and compare the approximate value of y(1) with the 
actual value. 


Solve numerically the differential equation 
dyldx = xy + y? 


with initial value y(0) = 1 using the improved Euler method. Use a 
step size of 0-1 for x up to 0-4. Work to three decimal places and 
give the answer to two decimal places. 


Solve numerically the differential equation 
dyldx = x — y? 


with initial value y(0) = 1, using the improved Euler method with 
step size 0-2 for x up to 1-0 and calculate to three decimal places. 
From a graph of the results find approximately the minimum value 
of y and the value of x at which y reaches this minimum. 


If all the numbers are rounded-off find the maximum possible 
error of the following calculations and give the answer to an 
appropriate number of decimal places 

(2:00 x 400) + 3-62 — 9-82 


If x = 0-56 is the result of rounding-off give the value of e~ 55 to an 
appropriate number of decimal places (tables give ε΄ 1:38 as 0-1864). 


14 
Numerical Methods II 


14.1 INTERPOLATION 


The reader is probably familiar with the idea of interpolation as 
applied to a table of values of a variable. For example, suppose that 
the function f(x) is tabulated at intervals of 0-1 from x = 1-0 tox = 20 
and that it is required to find f(1-05). The function f(x) may be a 
known function, but complicated, or it may be an unknown function 
the tabulation having being obtained as the result of an experiment. 
Let Χ(1 0) = 2-718 28 and (1-1) = 3-004 17 then an approximation to 
1 05) can be obtained by assuming that the graph of f(x) in the range 
1-0 <x < 1-1 isa straight line so that f(1-05) = 2-718 28 + (9004 17 
— 2718 28) = 2:861 22. 

The general process of finding a value of the function at other 
than the tabulated values is called interpolation, the simple process 
above is called /inear interpolation and is unlikely, in the example 
given, to be accurate beyond the second decimal place. 

A better approximation to the required value is obtained by using 
all of the known tabulated values instead of just the two values on 
either side of the interpolated point. This can be done by replacing 
the function f(x) by a simpler function p(x) that takes the values of 
f(x) at all of the tabulated points and from which the value of p(1-05) 
can be found by a simple computation. There are many types of replace- 
ment functions p(x), the simplest of which is a series of powers of x, 
ie. a polynomial function. Other types of replacement functions that 
are in common use are series of trigonometric functions, or series of 
exponential functions, or series of more complicated (but well known) 
mathematical functions. 

It is in this wider sense, that is in the finding of the best type of 
replacement function, that the word interpolation is often used nowa- 
days. The polynomial functions can lead to accurate interpolation 
formulae and these are the only functions to be discussed here. 


14.2 DIFFERENCE TABLE 


Suppose Vo, V1, Va, + + +> Vn is a set of values of a (known or unknown) 
function f(x) corresponding to a set of values xo, X1, X2, - - -» Xn 
written in ascending order of magnitude so that x%<x,.<... 
<Xn-1 «Φ χ,. The variable x is called the argument and the variable 
values y, are called the entries, the corresponding values being con- 
veniently displayed in table form. It is required to estimate the value 
of the entry that would be obtained for a value of the argument in the 
Tange Xp -χ < Xp. 
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Any reliable interpolation formula used to find this estimate will 
employ all the given information, that is it will be based on the values 
of the n + 1 entries. 

Instead of using the values directly it is found convenient to use 
relationships dependent on these values. The relationship y,,1, — y, is 
called the first difference of y, and is denoted by Ay,, i.e. 


Ay, = Vr+i — yr (1) 
The relationship Ay,,, — Ay,, being the difference of the first difference 
of y,, is called the second difference of y, and is denoted by A(Ay,) or 
A?y,, i.e. 


A’y, = Δ»: = Ay,. (2) 
For example, 


A*yo = Ay, — Ayo 
= ΟΣ - yy) -- Οἱ - Yo) 
= ya 2y1 + Yo; 
and similarly, A*y, = γᾳ — 2yo + yi. 


In general terms 
A?y, = Vr42 — Wear + Vrs r=0,1,2,...,.n—2. (3) 
In a similar manner the third differences may be formed so that 
Aty, = A%y,,1 ~ Aty, 

= (r43 -- 27}... + Yr41) — Ore2 -- 2yr44 + yr) 

= Vra3 τ 3»,ρ.ω + 341 — es P= 0,1,2,.. 2-3. (4 
Clearly, successive differences may be calculated, each difference 
being dependent on a selection of the given entries yo, V1, Yay.» +» Yas 


x y Ay ΔΎ Ay ΔΎ Ay 


Xo Yo 
AYo 
2 
x y Av 
Ay δ 
2 
x2 Y2 dy, Ay, 
3 5 
Ayo Δ Y, Δ Yo 
2 
x3 ¥3 Δ Y2 Ay, 
3 
Ay; ΥΩ 
x4 Ya Ly, 
AY4 
xs Ys 


Table 14.1 
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The polynomial interpolation formula, dependent on the n+ 1 
entries, may then be expressed in terms of these differences. For 
convenience the differences are set out in tabulated form, entries in 
successive difference columns being placed in the table mid-way 
between the two entries used in forming that difference as in Table 
14.1. 


Example Form the difference table for the function 
y=xt-2 
for x = 0 to x = 6 at intervals of 1 unit. 


Solution The values of y are tabulated and successive differences found 
as below: 


x y Ay Ay Avy 


—2 
1 
1 -ἰ 2 
3 0 
2 2 2 
5 0 
3 7 2 
7 0 
4 14 2 
9 0 
5 23 2 
11 
6 34 


In this example all the differences after the second are zero. It is no 
coincidence that the second differences of a second degree polynomial 
are the same. For, consider a polynomial of degree s, i.e. 

Y= α,χ' + aux t+... Fax + a 


and suppose a difference table is formed for equal intervals h of the 
argument x, then x,41 = x, + 4 and 


Ay, = Vra1 — Ve 
= [a(x, + AS +...J— lant...) 
= σι(χϑ + sxsth+...—x)+... 


a,shxs~+ + terms in lower powers of x,, 
and 
fag ἘΠΞΞ Ay, +1 = Ay, 
== a,sh[(x, + hy τ xs J+... 
= a,s(s — 1)h?x8-? + terms in lower powers of x,. 
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Each successive difference is a polynomial of degree one less than its 
predecessor. Finally, 
Asy, = a,(s!)h° 

= constant. (5) 
The equation (5) shows that the sth difference of a polynomial of 
degree s is constant and it follows that all subsequent differences are 
zero. The converse may be used to deduce a polynomial relationship 
as in the following example (see also example 2 of Exercise 14). 


Example In the following difference table 


x y Ay δὲν A’y 
0 4 
19 
Ι 23 18 
37 6 
2 60 24 
61 6 
3 121 30 
91 6 
4 212 36 
127 6 
339 42 
6 50g [45 


the third difference column is constant and there is an exact poly- 
nomial relationship between x and y of degree three in x. The observant 
reader will see that y = (x + 2)? — 4. 

It is possible for the values of the argument x to be given at unequal 
intervals and in this case the above tables are modified by employing 
divided differences. Only equal intervals of the argument are considered 
in this book. 


14.2.1 Errors in a difference table 

The first step in any interpolation procedure is the formation of a 
difference table from the given data and since this involves a large 
number of additions, subtractions, and entering of results it is a source 
of numerical error. A simple check can be obtained by adding the 
entries of any column of differences and comparing the result with the 
difference between the last and first entries of the previous column. 


For example, adding the entries of a second difference column 
A?yo + A®y, + A?yn +... + AYys 
= (Ayr -- Ayo) + (Ay2 — Ayr) +... + (Aye — Ays) 
= Ay, — Ayo. 
An error is frequently made in the entry column y and care must be 


taken when transferring the data from its given state (e.g. as the 
result of an experiment) to a difference table. Suppose that the entry 
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ys Of Table 14.1 is subject to an error ¢, then Table 14.2 shows how 
this error is propagated throughout the table. 

The error in (say) the third difference column appears in four terms 
with error coefficients given by the binomial coefficients of (1 — ¢)*. 


x y Ay ay Ay 
Xo Yo 
do 
xy y Δῖγο “το 
ay “μὰ 7 By + € 
xe eee Ly, τε 
“τὰ Δ,» + € 2 by, - 3e 
Xs Yar Δ, - δε 
ee ε ‘ By. + 3 
x Ya PSE Sie ys + € 
Δ ον Ὁ Bie 
Xs 75 i by, Ῥξοδο ἢ ἣ 
Ays ἘΞ 
x6 % 
Table 14.2 
Example Find the errors in the following table. 
x y Ay A’y A®y Error in A®y 
0 34:8 
38:6 
1 73:4 21-6 
60:2 4:8 
2 133-6 26:4 
866 4:8 
3 220:2 31:2 
117:8 4:3 = 48 — 0:5 
4 338-0 35:5 
153-3 6:3 = 48 + 1:5 
5 4913 ------- 418-—----- 
195-1 3:3 Ξε 48 -- 1:5 
6 686-4 45-1 
240-2 5:3 Ξε 4.8 + 0:5 
7 926-6 50-4 
290-6 3:8 
8 1217:2 55:2 
345:8 
9 1 563-0 
(1 528-2) (307-2) (32:6) 
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Solution Adding each difference column and checking against the 
difference of the first and last entries of the preceding column reveals 
an error in the last column, i.e. 55:2 — 21-6 = 33-6. Recalculation of 
A%y shows that the last entry should be 4-8 and not 3-8. The appearance 
of the A®y column suggests now that all the entries should be 4:8. 
The values 4-3, 6-3, 3-3, and 5-3 are 4-8 — 0:5, 4-8 + 3(0:5), 4-8 — 3(0-5), 
and 4:8 + 0:5 respectively. Comparison with Table 14.2 shows that 
there is an error ε = —0-5 in the entry 491-3. The corrected value 
is 491-8. It is left to the reader to complete the corrected table. 


14.3 THE NEWTON-GREGORY FORMULAE OF 
INTERPOLATION 


Let yen ) παι». + +» V-1s Yoo Vis + + +» Vn-1» Yn be values of the entries 
corresponding to equally spaced values x_n, X-nsiy sy Xa, Xo 
X14, ++ +) Xn-1, Xn in the argument. The unique polynomial function 
y = p(x) satisfying any m + 1 successive values is of degree n and 
could have the form 


P(x) = Ao + Ayx + Ax? +... 4 A,x" (6) 
the n + 1 unknowns A, being found from the n + 1 linear equations 
obtained by substituting the corresponding x and y values in function 
(6). 

14.3.1 The forward interpolation formula 


Suppose the » + 1 successive values are (x;,),), i= 0,1,2,...,7 
then it is more convenient to write function (6) in the form 


P(X) = ay + (x — Xo)ay + (ἡ — χρὶα — xan +. 
2 (ἃ -- XX — 1). . 2 (ἃ — Xp ,)α, (7) 


and to use the πὶ + 1 values to calculate the a;, Let the equal intervals 
between successive arguments be A, then 


Χ, το Xs = (r — 5). (8) 
The unknowns are then determined from the n + 1 equations 
Vr = Ay + rhay + r(r — MhPag + r(r ~— 1)(r — 2)A3ag +... 
+r(r— 1) —2)...(r το + I)h"a, 


=a t+ Σ γίν -- )γ(γ -- 2). .ὄὕ.(ν ~t4+ Data, (9) 
t=1 
withr=0,1,2,...,”. 
Forming successive differences leads to 
Ay, = )r+1 — yr 


= ha, ἘΣ (A -- 2)... t+ Dae (0) 
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A?y, = Ayrai — Ay, 
= 23. 
ἘΣ [κι -- Dre -- 1) -- 2). . .(ἡ -- -Ὁ 3)μ4α, (1D) 
ἐξ 8 
Δϑ r= Myr 1 = A’y, 
= 3. 2h8a, 
n 
+ = [et — 1I)¢ — 2) Ir — DG — 2) 
2. .r—t+4)h'a, (12) 


An-ty, = (n — 1) 185 16... + [nt — D@ — 2). . .2][rJa"a, (13) 
A*y, = nth"a, (14) 
From equations (9) to (14) it follows that 
Qo = Yo; αι = Ayo/h; ag = A*yo/2'h?; .. .; 


a, = A*yo/nth™ (15) 
and hence in equation (7) 
x—x χ — Xo)(x — x,) A? 
Poe τ ) ay, + & a oe ee 


(x — χορ -- ἊΣ . -(%—X_-1) =a (16) 


in which x is any value in the range x» < x < X,. 
If the argument is written 


Χ Ξε Χο Ἔ μὴ (7) 


where 0 <u <n, then, with the use of (8), equation (16) may be 
expressed in the form 


y = fx) = fo + uh) = po + uh) 


+ 


A2 
= yo + Ayo + uu -- NSP +... 


Ἢ uu — Du --- 2). ον ὦ ποῖ Ὁ ἢ (18) 
where p(x) is the polynomial approximating the actual (unknown) 
function f(x). An examination of Table 14.1 and equation (18) shows 
that the ‘path’ of this interpolation proceeds through the table from yo 
along the diagonal of the leading differences of yo, i.e. the formula uses 
those values forward (in the table) from yo. The formula (18) is frequently 
called the forward Newton-Gregory interpolation formula and is used 
mainly for interpolation near the beginning of the table where uw is 


Atyo 
nt} 
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small. The coefficients of equation (18) are binomial coefficients and the 
formula may be written 


n (uy 
7) ΞΞ)ο Ὁ Σ (*) A'yo (19) 


re (") is μίμ ~ 1)(u — as οὐ (ὦ πεν τῇ 1) 
r r! 
Example (i) In the following table use the result (18) to find (a) f(2-4), 
(b) f(8-7). 
x 2 4 6 8 10 
ST) 9-68 10:96 12:32 13:76 15:28 


Solution Form a difference table and note that all differences beyond 
the second are zero. 


x yp=f(x) Ay Δὲν 

2 9:68 
1-28 

4 10-96 0-08 
1:36 

6 12:32 008 
1-44 

8 13-76 0-08 
1-52 

10 15:28 


(a) From (17) with x τ 2.4, x»=2, h=2, then u=02 and 
equation (18) gives 
(0:08) 
#(2-4) = 9-68 + 0-2 x 1:28 + (0:2)(—0-8) —— +0+0... 
= 9-929 6 


(Ὁ) From (17) with x = 8-7, x9 = 2, then u = 3-35 and equation 
(18) gives 
(0-08) 
8:2 ΞΞ 9-68 + (3-35)(1-28) + (3°35)(2:35) —— 


= 14-282 9. 


Example (ii) In the following table of e? use the formula (18) to calcu- 
late (a) e° 12, and (Ὁ) 62:90, 


x ΟἹ 06 11 1-6 21 
οἷ 1-105 2 1-822 1 3-004 2 4:9530 8-166 2 
Solution Form a difference table and note that in this case there is no 


difference column that is constant (this is to be expected since e* cannot 
be represented by a polynomial function). 
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x yor Ay A’y Δὸν Aty 
0-1 1-105 2 
0-716 9 
06 1-822 1 0-465 2 
1-182 1 0-301 5 
1-1 3-004 2 0-766 7 0-196 2 
1-948 8 0-497 7 
1:6 4.9530 1:264 4 
3-213 2 
21 8:166 2 


(a) with x = 012, x» = 0:10, A = 0-5, then u = 0-04 and (18) gives 


e°-12 ~ 1-105 2 + (0-04)(0-716 9) + (0-04)(—0-96) os? 


+ (0-04)(—0-96)(— 1-96) Con) 


(0-196 2) 
24 


+ (0:04)(—0-96)(— 1-96)(—2-96) 


= 11269. 


The correct value to 5 decimal places is known to be 1:127 50. 
(b) with x = 2-00, xo = 0:10, h = 0-5, then u = 3-8 and equation (18) 
gives 


7°09 =~ 1-105 2 + 2:724 22 + 2-474 86 + 0-962 39 + 0-125 25 
= 7-391 9 (to four decimal places). 


The correct value to four decimal places is known to be 7-389 1. 

In example (i) the Newton-Gregory formula (18) for p(x) is identical 
with the function f(x), which is a quadratic function, and the results 
for f(2-4) and f(8-7) will therefore be correct to the number of decimal 
places retained. In example (ii) the function e* is replaced by a fourth 
degree polynomial which takes the value of e* at the five given entries. 
Because the successive differences decrease, the higher differences are 
relatively small so that any term in the formula has a smaller contribu- 
tion than its predecessor. For the range of values given (i.e. x = 0:1 
to x = 2:1) the quartic polynomial is a reasonable approximation to 
the actual function e?. The error involved in the estimation for 6013 is 
about 0-05 per cent and in the estimation for e?°°° is about 0:04 per cent. 


Example (iii) Use the formula (18) to estimate the value of f(2:5) in 
the following table of f(x). 


x 2 3 4 5 6 
fo 1 2 6 24 120 
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Solution Form the difference table from the given data 


x Κι) Ay Δὲν A’y Aty 
a My ΣΝ 
1 
3 2 3 
4 11 
4 6 14 53 
18 64 
5 24 78 
96 
6 120 


With x = 2:5, x) = 2, hk = 1, then u = 0:5 and formula (18) gives 
f(2:5) = 1 + 0.5 — 0:375 + 0-687 5 — 2-070 3 
= —0:257 8. 


The function f(x) in this example is well-known and is called the 
Gamma function or the generalized factorial function, f(x) = (x — 1)}. 
The value of f(2:5) is known to be 1-329 3. The estimated value is 
obviously incorrect, the reason being that the successive differences are 
increasing rapidly and the series obtained from formula (18) is a 
divergent one. The values of f(x) are increasing so rapidly that it is 
impossible for a fourth degree polynomial to approximate to the 
function even though it is exact at the given five values. In other words, 
the information provided (i.e. the given data) is too meagre to describe 
the behaviour of the function. 

Although the exponential function of example (ii) is also a rapidly 
increasing function the data are given with an interval of x small enough 
to be able to interpolate with reasonable accuracy within the range of 
the table. If here, the interval of the argument x had been too large 
(say A = 1) the interpolation formula would not have provided an 
accurate estimation except possibly at values extremely close to the 
given entries. The difference table below shows how rapidly the differ- 
ences are increasing for h = 1. 


x e7 Ay A*y Δϑν Δέν 
0 1 
1-718 3 
1 2:718 3 2:952 5 
4-670 8 5-073 1 
2 7.389} 8-025 6 8-717 6 
12-696 4 13-790 7 
3 20-0855 21-816 3 
34-5127 
4 54-598 2 


Another form of the interpolation formula may be used when inter- 
polation is required at a value near the end of the table. 
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14.3.2 The Newton—Gregory formula for backward interpolation 


Suppose that (Xo, yo) is at the end of the table then it is required to 
find a polynomial that satisfies the (γι + 1) values (x_;, y_), 1 = 0, 1,2,3, 
εν ἢ. Proceeding as in section 14.3.1 it is convenient to write function 
(6) in the form 


P(x) = δο + (x — Xo)bi + (ὁ — Xo) -- X-ibat+.. 


+ (x —— Xo)(x te Ks 1) see (x τῶ ΧΩ -2))ΡᾺ (20) 
with 
χ., το χος = —(r — s)h, (21) 


from which it follows that 
γα ἘΣ τῷ -- DT -- 2). «ον τοῦ Ὁ Dhl. (2) 
ἐπὶ 


Taking successive differences of equation (22) and using the notation 
of Table 14.3 leads to 


by = yo; hb; = Ay_1; 2\hbg = Aty_g3. . 3 nth", = Δ'γ..,. (23) 


oe ¥ Ay ὧ ay Ay 
Xn Y-0 
Ay-n 
Lent} Y-n+1 ὅν, 
x2 y-2 Δδὲγ-5 ay-4 
Ay-2 By-3 
x y-t δῦ. 
Ay-1 
xo Ὁ 
Table 14.3 


Substituting into equation (20) and using 
X =X + uh (24) 
gives the backward interpolation formula in the form 


y= f(x) = f(Xo + uh) = po + uh) 
A*y-2 Δὸν 9 
= yo + uAy_, + uu + Yay + μία + DU + 2) τοῦτ 


Aty_ 
ba fale + De +2). . (uta) (25) 
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which may be written 


~ π (μ ἜΤ -- ky 
Y= Yo ἘΣ ( r ) A ae (26) 
where (") = Men Men το rth, 


Table 14.4 shows the paths of the forward and backward interpolation 
formulae through the difference table for n = 5. 


5.5 y-8 
x4 y-4 
x3 y-3 
SK 
x O58 
4..χΧ 
x2 y-2 7 . ΔΥ 2 
Sys 
ee a 
x γι yw he 
yo Oa 
— 
Xo vn 
Ayo 
προ, 2 
xy n A Yo. 
279 
x2 Ya ~<a 
xs ¥3 
Newton — Gregory forward 
x4 ¥4 
—x——1-—— Newton—Gregory backward 
xs Ys 


Table 14.4 


Example Apply the backward interpolation formula (26) to find e?°°° 
in example (ii), section 14.3.1. 


Solution Using the difference table of that example and substituting 
into equation (26) with x = 2:00, xo = 2:10, A = 0-5, so that u = —0-2, 
leads to 

ε2 00 <= 8-166 2 — (0-2)(3-213 2) 


~ (0-2)(08) ( cade 


— (0-2X0-8(1'8) or 


— (0-2)(0-8)(1-8)(2°8) τῇ 2 


= 7.392 0. 
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14.3.3 Interpolation paths 


It has been observed (see Table 14.4) that the two Newton-Gregory 
formulae use the information of the difference table obtained by pro- 
ceeding through the table along different paths. The forward formula 
is useful when interpolating near the beginning of the table while the 
backward formula is useful when interpolating near the end of the table. 
Both formulae may be used with a reasonable expectation of accuracy 
providing that the interval ἢ in the given data is not too large and 
providing always that the higher differences decrease in value and 
become eventually negligibly small. 

Interpolation is frequently required near the centre of the table and 
the best interpolation formula in such a case will be obtained by pro- 
ceeding through the difference table by a path that is as near as possible 
to the horizontal line through the interpolated value. There are a number 
of different paths starting from (say) the value yo in the Table 14.4. 
Each path will provide a different form of the interpolating polynomial, 
the various coefficients being obtained from the differences encountered 
along and on either side of the path. 

Historically, each different formula (but all variants of the original 
polynomial (6)) was obtained separately and bears the name of its 
discoverer, e.g. Gauss, Bessel, Stirling. However, these formulae, and 
indeed any other formula, can be obtained from a simple diagram 
called the lozenge diagram together with four basic rules of derivation. 
The form of the diagram and the basic rules will be deduced from the 
two Newton formulae in the next section and then used to derive the 
other named formulae. 

The next section may be omitted at a first or casual reading and the 
reader may, without immediate loss, move to section 14.5 where these 
formulae are derived and applied to interpolation problems. 


14.4 THE LOZENGE DIAGRAM 


The Table 14.5 shows the lozenge diagram which is, in essence, the 
difference table about the point yp together with the binomial coefficients 
of the Newton-Gregory formulae displayed according to a definite 


. u— ‘ : : 
pattern. The coefficient ( ᾿ ‘) is placed immediately above the 
difference A’y,. It may be noted that (" 6 ‘) = 1 for alls. 


The two Newton formulae 
) =f(xo + uh) = po + uh) 


= Vo [ἢ δον +(5)a% + (3) a% +... + ("Jar +... 
(27) 
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and 


) =f(%o + uh) = p(x%o + uh) 


u ut] u+2 
=yo+ (I) ya πὶ 2 ) ary +( 3 ) Ary st... 


utr—1 ᾿ 
πο 


may be obtained from Table 14.5 by proceeding along the appropriate 


diagonal paths and applying the following two rules: 


Rule 1 If, on arrival at a difference entry, the slope of the step was 
negative then form the product of that difference and the binomial 
coefficient immediately above it; but, if the slope was positive then form 
the product of the difference and the binomial coefficient immediately 


below it. 


Rule 2 Add the successive products formed on proceeding through the 


table from left to right. 


x3 7.3 


Ἵ 


> 
ἣν 
ο 


Ἧ 
t 
n 
= ἕ 
-- ΩΣ 
7. x 
+ 
ἐϑ 
Ό. 
x” 
ee 


Co 
< 
Ὁ 


ut2 


ae “σι aN 4 


8 
! 
x 
c 
SS + 
D - 


“< 
4 
«§ Po. 
ND ON + 
— zh 
ἐν 
ς 
ο ἕ 
cs 
+ 
ν 


> 
τ 


--- 


8 
3 
δ 


i 


= 


= 
> 
ΝΣ 


Ν 


xy γι 


x 


MV 
γᾶν 
Wa 

ν᾽ 


8 
» 
= s 
aa 
᾿ 


Ὁ 


Table 14.5 


Example (i) Starting from y) and proceeding via the diagonal path of 


negative slope then 


u 
rans (‘Jane (Jans (Jans... 
which is equation (27). 
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Example (ii) Starting from yo and proceeding via the diagonal of 
positive slope 


+1 +2 
y= yo + (ἢ δ». ἘΣ )δν τ ("3 ) δόντι. 


which is equation (28). 


Example (iii) The backward Newton-Gregory formula starting at y, 
and using Rules 1 and 2 is 


1 
yent("7 ') ayo + (5) diya + ("3 ve ) ay. at. 


Suppose now that a zig-zag path passing through yo, Ayo, A®y_1, 
A®y_,, Δόέγ. 2, is considered, then the two rules would give the formula 


yore (Gant Gare 12.» 
ἐ{ 1 λυ τι, . (9) 


This is a valid interpolation formula known as the Gauss forward 
formula. 

Similarly taking a zig-zag path from y starting in the opposite 
direction leads to 


i 
y= κα (1) Ava ("9 Jaye (5 ') Ay. 
Ἐ{( 1} a... (30) 


which is a result known as the Gauss backward formula of interpolation. 

It is evident that Rules 1 and 2 are sufficient to provide certain inter- 
polation formulae but before it can be stated that any path gives a valid 
interpolation formula it is necessary to prove first that any two paths 
through the table with the same start and finish are identical and second 
that if any two paths, entering the table at different points, end at the 
same point then the derived formulae are identical. For example, in 
Table 14.5 a path that starts at yp and ends at A®%y_, will give a formula 
that is a sixth degree polynomial satisfying the seven values (x_3, y_s), 

. «5 (Xs, Ys) and the second requirement would show that the same 
polynomial should be found by any other path starting at (say) y. and 
ending at A®y_,. To prove the first requirement it is necessary only to 
consider the paths taken round one complete lozenge. Consider the 
paths 1 and 2 in Fig. 14.1. 
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Fig. 14.1 


The contribution from path 1 is 


u—s μ-- 5 Ὲ} 
( r Sea ( r+1 ae: 9 
and the contribution from path 2 is 
u—s u—s 
("τ 7 δον Fei) are 82 


That the contributions (31) and (32) are identical is easily shown by 
substituting 


Δ, ΞΞ A’ys-1 + Ares 8-1 
in (32) and using the fact that 


4. = u—s orm 

r+i1 Ἂν r τὶ, «1 

Since the contributions (31) and (32) are identical, then each is equiva- 
lent to 


ΤΊΣ ᾿ u—s 1 u—stil u—s ΕἾ 
2 +49] {{Ὁ yea {πὸ ἡε( τὴ] vie 
(33) 


Examination of (33) shows that the step from A’~1y, to A’**y,_, can 
be taken by the horizontal path 3 of Fig. 14.1 providing the following 
two rules are obeyed: 


Rule 3 If in a horizontal step from left to right, the path passes through 
a factor then add the product of that factor and the average of the 
differences immediately above and below. 


Rule 4 If in a horizontal step from left to right, the path passes through 
a column entry then add the product of that entry and the average of 
the factors immediately above and below. 
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Finally to justify the second requirement that the same formula is 
obtained by different paths terminating at the same point in the table, 
consider the two paths of Fig. 14.2. 


ἢ [ἢ 


Δ). erg) 
2 utt 
yv——D—{? }—-O— {3 
Ayo ( 2 ) ὩΣ ay, 
mr (7) Be 
πω (2) 
Ye 
Fig. 14.2 
The contributions are 
1 1 
Path 1: yo + $[Ay_1 + Ayo] (i) +5[("3 ) + ( A*y_y 


=1 a | 
Path 2: γι + Ι: 1 Ay, + (" 2 ) Δξγο + (9 Δὅγ.; (35) 
In (34), put 


Ay_1 = Δγο — A*y_; 
= Ayo — A’yo + A®y_i, 
and in (35), put 
Yi =Yot+ Ayo, 
Ay, = Ayo + A*yo, 


then the contributions are 


Path 1: yo + (η Ayo + ἐδῆγο ee ') - (2) - () 
ἘΚ: [ εἰ e ᾿ ee ) 


—1 — 1 
Path 2: yo + Ayo [ + (" ἢ } + Ay, "" Ι 
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The coefficient of A*y_, in (36) is 
1 E (ut lu ut Dente -, ἢ uu-— 2] 


_ a -- γὼ -- >| 


3 
5 :) 
“13 
which is the coefficient in (37). It is left to the reader to show that the 
other coefficients of (36) and (37) are identical. 


145 THE INTERPOLATION FORMULAE 


Using the lozenge diagram of Table 14.5 and the four basic rules it is 
now a simple matter to derive any interpolation formula by choosing 
a path through the table from left to right. Note that a step from right 
to left can be included if the product is subtracted instead of added. 
Of the many formulae that can be obtained, four have already been 
mentioned, the two Newton-Gregory and the two Gauss formulae (see 
(27), (28), (29), (30), of section 14.4). 

It has been noted that the Newton-Gregory formulae are useful for 
interpolation near the ends of the table. The Gaussian formulae are 
useful near the centre of the table but are superseded by two other 
formulae which proceed through the table via horizontal paths. 


14.5.1 The Stirling formula 


This is a formula obtained from the path starting at yo (see Table 14.5) 
and proceeding through the table on a horizontal path. Using rules 2, 
3, and 4, 


εἰ) eras oad[Ct) «(ja 


1 1 
£5 (" 5 ἡ Wt δὴν ὦ 


HCE) + CT Yor 


u(Ay_, + A u? u(u2 — 1) (A8 tay 
aes ya Ye) Ary. + τ )( )- -1) 
wuz — 1 
HD ayy ee (38) 


Note that this is just the average of the two Gauss formulae (29) and 
(30) of section 14.4. 
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14.5.2 The Bessel formula 

This interpolation formula is obtained by taking a path that starts 
midway between the entries yy and y, and proceeds on a horizontal path 
through the table. 


x3 Y¥-3 
N. 
mo 
“L-2 = Y-2 W-4 
me 
Ly Yt te 
a x Ave : y-Oy-3 _* Go 
ὌΠ ee ---Ξ Ὄλας Shige he 
as Jeo ye ay go τον Ξθρι 
xy γι 8p Ξ 
πο 
x2 y2 πω 
Ν, 
x3 γ3 
ΚΕΥ 
N, = Newton forward Gz = Gauss backward 
Nz = Newton backward S = Stirling 
G, = Gauss forward B = Bessel 
Fig. 14.3 
From Table 14.5, using rules 2, 3, and 4, 
1[/u μ-- 1 
= 3(Vo + yi) + 2 [(Π + ( 1 )] Ayo 
1 u+1 4u 
+3 (2)toy + a3 {{Ὁ a) ty 
μ4- 1 
ἘΣ( Ὁ ) ost δύ τὶ 
+ — 1) (A? = A? 
_ Yo tu Pay + A we )( -1 Yo) 
_— Uu .-- 
+ Mi ΕΝ Δὃν.. Pian (39) 


A particularly simple form of (39) may be used to compute the value 
of y midway between the given values yo and y,. With u = 3, the 
coefficients of all the odd differences are zero and the formula becomes 


Yot γι ἔα Aye. 3 AV kes are 


gE eR 2 128 2 
1.3.5....(2n—1)P Δϑῆγ, + A2%y_, 
[ (2n ] Yen + προς (0) 


a 2A(In)t 3 
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The result (40) is called the formula for interpolating to halves. 
The paths taken through the table to derive the named formulae of 
interpolations are shown together in Fig. 14.3. 


Example (i) Use Stirling’s formula to compute tan (89° 26’) given the 
following table of values of tan x. 

89° 21’ 89° 23’ 89° 25’ 89°27’ = 89° 29" 
88-14 92-91 98-22 104-17 110-90 


Solution Form the difference table and apply formula (38) with 
x = 89° 26’, χρ = 89° 25’, ἢ = 2’, and u = 0:5. 


x 
tan x 


x y Ay δ δν Aty 
89°21’ 88-14 
4-77 
89° 23’ 929] 0-54 
5.31 0-10 
89° 25’ 98:22 0-64 0-04 
5-95 0-14 
89°27" 104-17 0-78 
6-73 


89° 29’ 110-90 
Retaining all decimal places, 
tan (89° 26’) = 98-22 + 0°5(5-63) + 4$(0-25)(0-64) 
+ 4(0-5)(—0-75)(0-12) + #2(0-25)(—0-75)(0-04) 
= 98-220 + 2.815 + 0-080 — 0-007 5 ~ 0-000 312 5 
= 101-107. 


Example (ii) Use Bessel’s formula to estimate ¥/46-24 given the follow- 
ing table of y = ¥/x. 

x 41 45 49 53 

yx 3-448 2 3-5569 3-659 3 3-756 3 


Solution Form the difference table and use formula (39) with x = 46-24, 
Xo = 45, h = 40, and u = 0-31. 


x y Ay Δὲν Δὃν 
41 3-448 2 

0-108 7 
45 35569 —0-006 3 

0-102 4 0-000 9 
49 36593 — 0-005 4 

0-097 0 


33 3-756 3 


374 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


Then, | 
4/46:24 = 3-608 1 + (—0-19)(0-102 4) 
+ 4(0-31)(—0-69)(—0-005 85) 
+ $(0-31)(—0-19)(—0-69)(0-000 9) 
= 3-608 1 — 0-019 456 + 0-000 626 + 0-000 006 
= 3-589 3. 
Example (iii) Form a cubic polynomial that may be used to replace 
sin x in the range 0-90 < x < 1-20 given the four values 
x 0-90 1-00 1-10 1-20 
sin x 0-7833 0:8415 08912  0-9320 


Solution Form the difference table, 


x ) Ay A*y Aty 
0-9 07833 
0-058 2 
[Ὁ 0-841 5 —0-008 5 
0-049 7 —0-000 4 
1-10-8912 —0-008 9 
0-040 8 
1-20-9320 


and use Bessel’s formula (39) with xp = 1-0, ἢ = 0:1, then, 
sin (1 + 0-1u) = 0-866 35 + (u — 4)(0-049 7) 
+ ἐμ — 1)(—0-008 7) 
+ gu(u — ἐγ — 1)(—0-000 4) 
= 0°841 50 + 0:054 02u — 0-004 25u? — 0-000 07u*. 
Putting u = 10v then, 
y = 0-841 50 + 0-540 20 — 0-425v? — 0-070? 

may be used as the cubic expression to represent sin (1 + v) in the 
range —0-1 <v <0-2. Finally, replacement of v by x—1 gives the 
cubic polynomial in x that represents sin x. 
14.5.3 The accuracy of the formulae 


Each formula derived for the function f(x» + uh) is an approximating 
polynomial. If the actual function f(x) is a polynomial in x then, 
providing the difference table can be taken far enough (i.e. that enough 
data are given and used) all the formulae are equally accurate and will 
give an exact result (assuming the original data to be exact). 

If the function f(x) is not a polynomial then the difference table 
should first be examined to see that the successive differences Ay,, A?y,, 
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Ay, . . . decrease, which means that successive terms in the poly- 
nomial approximation also decrease. Some indication of the accuracy 
to be expected is often given by an examination of the difference 
columns. If the differences decrease rapidly and the rth difference 
column is approximately constant then it is reasonable to expect that 
the formulae will give results as accurate as the given data. If the 
differences slowly decrease or there is no column of (approximate) 
constants then the reader should be aware that a problem of accuracy 
does exist. When the differences increase it is highly probable that the 
interpolation is impossible, the formula used being part of a divergent 
series. Interpolation in the latter case might be possible if the differences 
increase so slowly that the binomial coefficients in the formulae are 
able to control the size of the terms and ensure a convergent series. 
The difficulty of knowing whether a result is approximately correct or 
not is usually removed by an examination of the remainder term of the 
interpolating series used. The discussion involved in this examination 
is not considered here. 

The choice of interpolating polynomial depends on the position in 
the table of the interpolated value. It is usual to keep uw as small as 
possible but to use a path through the table that gives a polynomial 
formula using all the available information. The following statements 
read in conjunction with Fig. 14.3 may be taken as general rules: 


Rule 1 If xq is at the start of the table, 1.6. y is known for argument 
values Xo, X1, X2,. . ., and interpolation is required near the beginning 
of the table then the best interpolation formula is the Newton-Gregory 
forward formula. 


Rule 2 If xo is at the end of the table, i.e. y is known for argument 
values Xo, X-1, X-2,- . ., and interpolation is required near the end of 
the table then the best result is obtained from the Newton-Gregory 
backward formula of interpolation. 


Rule 3 If xo is in the centre of the table, i.e. y is known for argument 
values on either side of x, and interpolation is required near the centre 
of the table then the best formula to use is either that of Stirling or of 
Bessel. 


If interpolation is required at either side of x) for the range 
—}<u< +H, then the Stirling formula should be used. 

If interpolation is required near the middle of the interval (x9, x,) 
for the range } <u < 3, then the formula of Bessel should be used. 


Example Form the difference table from the given data for y = sin x° 
and estimate y when (i) x = 12, (ii) x = 31, (iii) x = 37, (iv) x = 45. 
x 10 20 30 40 50 60 

y 1017365 0:34202 0:50000 0:64279 0:76604 0-866 03 
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Solution In the following difference table the decimal point is under- 


stood, the fifth difference for example being 0-000 18. 


x y Ay Δὲν δν δν AY 
Ι0. 17365 
16837 
20 34202 — 1039 
15798 —480 
30 50000 —1519 45 
14279 —435 18 
40 64279 —1954 63 
12325 —372 
50 76604 --2326 
9999 
60 8δ6603 


(i) For x = 12, take x) = 10 then, since h = 10, μ τῷ 052 and the 
forward Newton-Gregory formula may be used. From equation (27) of 
section 14.4 

0. 
y(12) = 0-173 65 + (0:2)(0:168 37) + eo 


4 a ἘΞ ῸῸ (—0-004 80) 


0-2)(—0-8)(— 1-8)(—2-8 
+ C2LK 0-18-28) (999 45 
go 2)(—0-8)(— 1-8)(—2-8)(—3:8) 
120 
= 0:173 65 + 0-033 674 + 0-000 831 2 — 0-000 230 4 
— 0-000 001 5 + 0-000 004 5 
= 0-207 93 
The result correct to five decimal places is known to be 0-207 91. The 
estimate is in error by less than 0-01 per cent. 


(ii) For x = 31, take x» = 30, A = 10, then u = 0-1 and the Stirling 
formula (38) of section 14.5.1 gives 


(—0-010 39) 


(0-000 18) 


yG1) = 05 + 0-1)(0-150 385) + SY (--0015 19) 


4 enn) (0-004 575) + eon oh) 


= 0-500 00 + 0-015 038 5 — 0-000 075 95 
— 0-000 075 49 — 0-000 000 19 
= 0-514 89. 
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The correct result is 0-515 04 and the estimate is in error by less than 
0-03 per cent. 


(iii) For x = 37, χο = 30, h = 10, then u = 0-7 and Bessel’s formula 


(39) gives 
(0-7)(—0-3) 
2 


y(37) = 0-571 395 + (0-2)(0-142 79) + (—0-017 365) 


4 ©2O2X—0) 5694 35) 


(0-7(—0-51)(— 1:3) 
+ 54 
(0:7)(0:2)(—0-51)(— 1-3) 
iF 120 
= 0-571 395 + 0-028 558 + 0001 823 325 
4+ 0-000 030 45 + 0-000 010 442 + 0-000 002 


The correct result, to five decimal places, is 0-601 81 and there is an 
error less than 0-002 per cent in this estimate. 

Using the Lozenge diagram the reader should verify the derivation 
of the sixth term in the formula. 


(0-000 54) 


(0-000 18) 


(iv) For x = 45, xo = 40, h = 10, then u = 0-5 and using Bessel’s 
formula (40) gives 
y(45) = 0-704 415 — 4(—0-021 40) + τῆν (0-000 72) 
= 0-704 415 + 0-002 675 + 0-000 017 
= 0-707 11. 


In deriving the average coefficient 0-000 72 of the third term of y(45) 
it has been assumed that A‘y, is constant at 0-000 18, i.e. the coefficient 
0-000 72 is the average of 0-000 63 and 0-000 81. The estimated value is 
correct to five decimal places. 


Finally it is possible to give an indication of the superiority that the 
central difference formulae of Stirling and Bessel have over the Newton- 
Gregory formulae by a comparison of the coefficients of the terms. 
Consider the case of u = 0-3 in the formulae (27) of section 14.4, (38) 
of section 14.5.1, and (39) of section 14.5.2. The three formulae based 
on the value x = x, are as follows: 


Newton-Gregory: 
W(Xo + 0:3h) = yo + O-3Ayo — 0-105A2y, + 0-059 SA), 
— 0-040 16Aty, +... 
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Stirling: 


Δν.. ἘΔ 
(ον ct Oo) At Yo) + 0.045A2y_, 


Y%o + 0°3h) = Yo + 0.3 


— 0-045 5 


A’y_ A®y_ 
Cyt aoe — 0-003 412 SAty_ 


ee 
Bessel: 


2 2 
TAO aI) = Ὁ ΣΙ -- (0-2)Ayo -- 0:105 Cyn + 00 


4 4 
+ 0-007A%y_, + 0019 3; Qe ia 


It can be observed that after the first few terms the coefficients in the 
Stirling formula decrease more rapidly than those of the Bessel formula 
which in turn decrease more rapidly than those of the Newton-Gregory 
formula. The contribution from the higher order terms of the Stirling 
formula is much less than that of the Newton-Gregory formula. If the 
differences are only decreasing slowly or if there are round-off errors 
in the initial data (which are magnified in the higher difference columns, 
see section 14.2.1) it is preferable, whenever possible, to use a central 
difference formula such as.that of Stirling or Bessel. 


14.6 INVERSE INTERPOLATION 


Inverse interpolation applied to a set of tabulated values (x, y)), 
i= —n,...,—1,0,+1,. . ., + requires an estimation of x for some 
given y in the range y_, <_y < y,,. In essence the process reduces to 
solving the interpolation formula y = p(x) for x, given the value of y. 
The formulae of section 14.5 have all been expressed in the form 
) = p(x + wh) and it usually easier to solve for u and then to determine 
x. The process is simply that of determining the root of a polynomial 
equation, a process discussed in some detail in Chapter 13. 

The interpolation formula used depends on the position in the table 
of the value required. Only one method of attack is discussed here 
namely the iterative method of section 13.2.2. The equation p(xo + uh) 
= constant is written in the form 


u = $(u) 
and the iterative process 
U, 41 = φίμ,) 


is used. 
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Example Find the value of x for which y = 10 in the following table 
x 2:8 2:9 3-0 3-1 3:2 
y 8-252 7 9:1146 100677 11-1215 12-2866 
Solution Form the difference table 


x y Ay A’y Δϑν Aty 
2:8 8:2527 
0-861 9 
2:9 9:.114 6 0-091 2 
0-953 1 0-009 5 
30 10-0677 0:100 7 0-001 1 
1-053 8 00106 
3-1 11-1215 0-111 3 
1-165 1 
32 12-2866 


Because the value y = 10 is near the centre value the best interpola- 
tion formula to use is that of Stirling. With x, = 3, h = ΟἹ, and 
x = 30 + 0-1u then equation (38) leads to the approximation 


10 = 10-067 7 + u(1-003 45) + u?(0-050 35) 


2 2(,,2 ... 
εὐ - ὃ oo10 05) + MHD (0-001 1), 
6 24 
which may be rewritten in the form 
0-067 7 (0-050 35) (0-001 675) 
Og pote ne 2.82 NS at ΧΟ tele BI 
“= 00345 t Δ Τρῦ345 + MY ~ DT 66345 
(0-000 046) 
24,2 ——____ 
Pee) 00345 
= φῶ. 
For the purposes of calculation, put u = —v then 


v = 0-067 467 + 07(0-050 177) + v(1 — v?)(0-001 669) 
— v?(1 — v?)(0-000 046) 
and the iteration proceeds as follows 


Vo = 0 v3 = 0-067 8106 
v, = 0-067 467 v, = 0-067 8106 
Ug = 0-067 807 


The value of u may therefore be taken as —0-067 81 from which 
X = Xo + uh = 2-993 2 to four decimal places. The table is, in fact, 
one for y = cosh x and the result correct to four decimal places is 
known to be 2-993 3. The error in the calculated value is 0-003 per cent. 
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147 NUMERICAL DIFFERENTIATION 


This is the process of calculation of the derivative dy/dx from given 
data (x;, γι), When the actual relationship between x and y is unknown. 
The procedure is to replace the true relationship y = f(x) by the best 
approximating polynomial y = p(x) and then to differentiate the latter. 
The interpolation polynomial to be used depends as usual on the 
position in the table of the value at which the derivative is required. 
The interpolation formulae obtained in the previous sections have 
all been expressed in terms of the variable u rather than x, in the form 


YX) = f%o + uh) = p(x) (41) 
and differentiation gives 
dy _dydu_idy lap 


dx ~ du dx ἡ μ΄ h die (2 
Further derivatives may be obtained, for example 
oe a ep 
Bae (43) 


Example From the data given, compute (i) dy/dx when x = 3-21, 
( d?y/dx? when x = 3.8. 


x 3-0 3:5 40 4:5 
y 1-484 30 1-550 23 1-607 46 1-658 O1 
Solution Construct the difference table 
x y Ay Δὲν Δϑν 
30 1.48430 
0-065 93 
3.35 1[:55023 —0-008 70 
0-057 23 0-002 02 
40 1-607 46 —0-006 68 
0:050 55 
4.5 1-658 01 


(i) To find dy/dx at x = 3.21 near the start of the table use the Newton- 
Gregory formula (27) based on xp = 3, h = 0:5, and u = 0-42 then, 


y = 1-484 30 + u(0-065 93) + a ) (—0-008 70) 
4 en 3.6.9 (0-002 02) 


and with (42), 
0:5 dy/dx = [0-065 93 — (2u — 1)(0-004 35) 
+ Gu? — 6u + 2)(0-000 336)],,-0-42 
= 0-065 93 + 0-000 696 + 0-000 003 
= 0-066 629. 
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Thus dy/dx = 0-133 2, when x = 3-21. 
(ii) To find d?y/dx? at x = 3-8 near the centre of the table use the 
Bessel formula (39) based on xp = 3.5, A = 0.5, and u = 0-6, then 


y = 1-578 845 + (ὦ — 4)(0-057 23) + 4u(u — 1)(—0-007 69) 
+ ¢uyu — 4)(u — 1)(0-002 02) 


and 
0-5 dy/dx = 0-057 23 + (ὦ — 4)(—0-007 69) 
+ 4(3u? — 3u + 4)(0-002 02) 


and so, 
dy 
(0-5)? Tt = [—0-007 69 + (ὦ — 4)(0-002 02)],..0.6 
2 
or [52] τς (—0-007 488)0-25 
x 1338 


= —0-029 9(52). 
The given data are those of the function 


y = logy (10x + 0:5) 


from which 
dy μὴ 
ΕΝ = 0 133 22 and I... = —0 029 30 


and the estimated value in (i) is in error by approximately 0-03 per cent 
while that in (ii) is in error by over 2 per cent. 

The process of successive differentiation has the effect of exaggerating 
the errors involved in the original approximations, i.e. if the data are 
accurate to a specified number of decimal places then the derivative 
obtained by the above process is usually accurate to a lesser number of 
decimal places. 


14.7.1 Maxima and minima 


To find the approximate turning value of a function it is necessary to 
form the approximate derivative, equate to zero and solve the resultant 
equation for μ (and hence for x). 


Example Find the turning point, given the following data. 


x 2 3 4 5 6 
y 311875 120275 = 2-865 3 3-7052 145440 
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Solution Form the difference table in the usual way. A rough sketch 
shows that there is a minimum in the region x = 4-5. 


x y Ay Δὲν Δὸν Aty 


2 31-187 5 
—19-1600 
3 12-027 5 9-997 8 
—9-162 2 0-004 3 
4 2°865 3 10-002 1 —0-007 5 
0-839 9 —0-003 2 
5 3-705 2 9-998 9 
10-838 8 
6 14-544 0 


Using Bessel’s formula (39) of section 14.5.2 with x» = 40, A = 1, 
then 


y 3-285 25 + (u — 90-839 9) + WU= ἢ em 


4 u(u — De - ἢ 


1) (10-000 5) 
(--0003 2) 


and h& 0-839 9 + (u -- 4)(10-000 5) + (Gu? -- 3u + Ὀς- —0-000 53). 


An approximation to the value of u corresponding to the minimum 
point is given by solving the approximate equation 


0-001 6u? — 10-002 1u + 4-160 6 = 0. 
Writing this in the form 
u = 0-416 0 + 0-000 16u? 


the approximate root is 0-416 from which it follows that x = 4-416. It 
is left to the reader to show that the corresponding minimum value is 
given by y = 2. 


148 NUMERICAL INTEGRATION 

This is the process of forming a definite integral of y from data given 

by a set of tabulated values of x and y. The process is frequently called 

integration by quadrature. The data may be experimental, in which case 

the actual relationship y = f(x) is unknown but the integral f f(x) dx 

is required over some range of argument within the range of given data. 
b 


Frequently however it is necessary to compute the integral | f(x) dx 


a 
for a known function f(x) for which the methods of integration dis- 
cussed in Chapter 6 do not apply. In this case the values of f(x) are 
tabulated at intervals of the argument x within the range (a, δ) so as to 
form the required data. 
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An interpolating polynomial is now used to replace the function f(x), 
and the approximation obtained by term by term integration. Instead 
of using one polynomial to replace f(x) in the whole range (a, δ) the 
usual procedure is to write 


[rea = [sera t [poo ar. + [fe as 


where a <a, <ag<...<a,< band then to replace f(x) of each 
sub-range by interpolation polynomials of the same kind. The error in 
the approximation depends on how closely the interpolation poly- 
nomials approximate the function f(x). Since integration is a summation 
process it is reasonable to suppose that errors in the original data, some 
of which may be positive and some negative, tend to be smoothed out. 

There are many different kinds of quadrature formulae each based 
on integration of a particular kind of interpolating polynomial. The 
simplest to use and often the most effective are those based on the 
forward Newton-Gregory polynomial. 


14.8.1 Quadrature formulae based on the Newton-Gregory polynomial 


Consider a function y given at n equal intervals of the argument x from 
Xo tO Xo + nh then, since x = xy + uh, dx = hdu, the integration of 
the forward Newton-Gregory formula (27) leads to 


Zo tnh n u u u 
10 u=0 


(") Atyy+.. | du (44) 


The general term of equation (44) involves the integration with respect 
uy. 
to u of (‘). 1.6. 


π 
al μίμ — 1I(u—2)...(u~r+1) du. 

It is possible to integrate such an expression for arbitrary r but for the 
simple quadrature formulae based on equation (44) there is no need for 
such generality. 

Suppose it is required to evaluate the integral of f(x) over an interval 
from x = a to x = ὃ. The first step is to subdivide the interval (a, δ) 
into a number of equal parts. If there are N such parts the argument 
values may be written 


Ω = Xo, X1, Xa. ... Χν-αν XW ΞΞ ὃ 


with x, — x», = handb—a= Nh. 
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The second step is to take the argument (and ordinate) values in 
groups of p at a time, i.e. 
(Xo; My s νι Xp-1)3 (Xp-1 5: «ὃς Xop-1)3 a a8 ( aes Xy) 


and then to apply the result (44) to each group in turn with n = p — 1. 
A different integration formula will result for each different value of p. 
The most useful results are those obtained with p = 2, p = 3, and 
p=i7. 


14.8.2 The trapezoidal rule 


This is the case with p = 2, when the N + 1 argument values are 
grouped in pairs (Xo, xi), (%1, Χο) . - . (Xv-1, Xn) and equation (44) is 
applied with n = 1. The polynomial in the integrand of equation (44) 
is assumed to be of the first degree, all differences after the first being 
taken as zero. Thus 


ty na 
[ γάχτε ἢ [ (Yo  πδργο)μ = ἤ(γο + Ayo) 


= hlyo + ἐσσι — yo)] 
= thio πὰ yu) 


[ VR ys 


ΖΝ 
[ Pie (eee ϑόν 
TN-1 


Hence 


ὃ In 
[va | y dx = th(yo + 2y1 + 2yo - - . .ὄ + 2yy-a + yy). 
a Zo 
(45) 


This is the trapezoidal rule. It is a simple result to apply but is not 
particularly accurate unless ἢ is made extremely small (and N very 
large). Geometrically, the process of integration 


[ ἜΝ ΩΝ 


can be represented as the area under the curve y = f(x) and the trape- 
zoidal rule simply replaces the curve by a set of chords through the 
points (χ,. y-) and (χ,.., ¥r+1) With r =0,1,...,N—1 and then 
approximates to the actual area under the curve by adding together 
the trapezium areas as in Fig. 14.4, 
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14.8.3 Simpson’s rule 


This is the case with p = 3, for which the N + 1 argument values are 
grouped, three at a time, i.e. 


(Xo, X1, X2)3 (Xa, Xa, X4)3- 0, (χν...» Χν-1» Xn) 


It is evident that N, the number of intervals, must be even and the 
number of ordinates must be odd. The integration (44) is applied to 


o” 
” 
Pid 


«(ςςς.ς 


lex 


t 
> 

! 
& 
2 
8 


Sy 


A= αὶ Le +h 


Fig. 14.4 


each group with n = 2. The integrand is assumed to be of the second 
degree all differences after the second being omitted. This leads to 


2 2 
i vadx= al [Yo + uAyo + ἐμίμ — 1)A? yo] du 
Zo a 


= hl2yo + 2Ayo + ZA*yo] 
= hl2yo + 201 — yo) + 42 — 2y1 + yo)] 
= shLyo + 4y, + ye] 

and to 


EZ" h 
[ ) dx τὸ 3 [yo + 4ya + yah 
12 


ἄν h 
[ ydx= 3 [yv-2 + 4yy-1 + yyl- 
2ν-- 
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Adding the results gives 


ὃ h 
[var Flot) +20rt y+. εν  )ν-- α) 


+ 401 +yst... + Yn-1)] (46) 


This simple and reasonably accurate formula is known as Simpson's 
rule. The reader should note the grouping of the terms inside the square 
bracket in equation (46): the sum of the first and last ordinates, twice 
the sum of the even ordinates, four times the sum of remaining odd 
ordinates. 

Geometrically the process consists of passing arcs of second degree 
polynomials, 1.6. parabolas with axes parallel to the y-axis, through 
successive trios of points and evaluating the area under these parabolas 
as an approximation to the area under the actual curve of y = f(x). 


14.8.4 Weddle’s rule 


This is the case of p = 7 (the intermediate cases p = 4, 5, 6 lead to 
formulae that lack simplicity), for which the N + 1 argument values 
are grouped seven at a time, i.e. 

(Xo, χα, Xa + + sy X6)3 (Xs, #907 88 Χ12); τον ( oo Xy)- 


Here N must be an integral multiple of six. The integration (44) is now 
applied to each group with n = 6 so that in (44) all differences after 
the sixth are assumed to be zero. Then, for the first group 


18 6 
[ γάχ--Ξ a{ [vo + uAyo + 4u(u — )Δῆγο 
Zo u=0 


u 
+ gu(u — γὼ — 2)A%yp +. 2 . + (6) A®yo] du 
= h[6yo + 18Ayo + 27A* yo + 24Δϑγο + ΝΈΩΝ 
BA5y, + ies A®yo]. 
Before proceeding with the substitution in terms of the ordinates 
Vos Yi» Yar + + + Ye, the last term is amended to (3A%yp)/10 instead of 
(41 A%y,)/140. This introduces an error of (hA®yo)/140 into the evalua- 
tion, which will be insignificant providing that ἢ and A®y, are small. 


Replacing the differences by their expressions in yo, y1, Ya,. - . Ye leads, 
after some reduction, to 


[ y dx = Fsh(yo + Sy + Ye + 6y3 + ys + 576 + Ye): 
Zo 
Similarly, for the second group of terms 


712 
[ ya = Fh. + Sy_ + Ve + Vo + γιὸ + SVi1 + )1) 
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Repeating for each successive group of terms and adding leads to 


ὃ ΖΝ 
[»4-| ydx 
a Zo 


= +h [= con (47) 
where 
ὃι = 1,5, 1, 6, 1, 5; 2, 5, 1,6,1,5;2,...3...5...,5; 
2, 5, 1, 6, 1, 5, 1. 


This is Weddle’s rule. The coefficients c, are most simply remembered 
in groups, i.e. 
first group: 1, 5, 1, 6, 1, 5 
all interior groups: 2, 5, 1, 6, 1, 5 
last group: 2, 5, 1, 6, 1, 5,1 


The geometrical interpretation is that the portion of the curve 
y =/f(x) through each successive group of seven points is replaced by 
an arc of a sixth degree polynomial through these seven points. The 
rule is more accurate but requires more calculation than does Simpson’s 
rule. 


14.8.5 Quadrature formulae based on other interpolation polynomials 


Further formulae can be produced dependent on the integration of the 
Stirling, Gauss, and Bessel interpolation polynomials. These results, 
while providing rapidly converging quadrature formulae, have the dis- 
advantage of difficulty in practical application. Such formulae are not 
discussed here partly because of this disadvantage and partly because 
they are superseded by more sophisticated techniques which have 
application to computer programs. 


14.8.6 Application of trapezoidal, Simpson’s and Weddle’s rule 
As stated earlier if, as the result of an experiment, data are available 
in the form (x, y;), 7 = 0,1, 2,. . ., N then an approximation to the 
integral of f(x) from xo to xy can be obtained by immediately applying 
the results of the previous sections. If it is required to evaluate the 
integral of some known function then the data must be computed before 
proceeding. In each case the accuracy depends, in geometrical terms, 
on the accuracy with which the areas under the polynomials approxi- 
mate to the areas under the actual curve. An estimate of the error 
involved may be easily found. 

In the case of the trapezoidal rule the principal part of the error in 
each integration is the first term neglected in the integral (44), i.e. the 


error in 
τι 
[ y dx 
to 
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is the first term of 


—h 
or 2 A*yo. 
The total error in the use of the trapezoidal rule is 
—h 
ΤΣ Atv t Att... + Ayes) (48) 
In using Simpson’s rule the actual error in the integral from xp to 
Xq is 
2 
af ((3) Aty, + [ Atyy +. . | du 
Ὁ 
2 u 1 2 
Now, ( du=z 1 u(u— 1)(u — 2) du 
. ho 3! Jo 
1 
= 3 [4—8 + 4] 
=0 


and the principal part of the error is given by 


2 (μ 
4 (ῃ Δέγο du 


h 
which reduces to τ δ0 Δέγο. 


The total error involved in the use of Simpson’s rule is then 
h 
= 90 [A*yo + Atys + ΔΈ 4 +...+ Δένγ- 4]. (49) 


The fact that the error depends only on the fourth order differences 
means that Simpson’s rule will be exact when f(x) is a polynomial of 
the first, second or third degree. 
In applying Weddle’s rule the principal error arises from the omission 
— Aé 
of the term — 140 A®yo. 
The total error is then 


h 
— πχοῖδονο + A®ve ++ AI (50) 


Weddle’s rule will be exact for the integral of f(x) when the latter is a 
polynomial of up to the fifth degree. 
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This chapter is concluded by examples of evaluation of an integral 
by each of the three rules. 
0-6 
Example (i) Estimate the value of i sin x dx using a sub-division of 


it) 
0-1 in the argument, (i) by the trapezoidal rule, (ii) by Simpson’s rule, 
and (iii) by Weddle’s rule. Compare the results with the exact value 
obtained by integration. 


Solution This example is used merely as an illustration of the numerical 
processes; the correct value of the integral may be obtained using tables 
of trigonometric functions, i.e. 


0-6 0:6 
[ sin x dx = [τους x| 
0 0 


= 1 — οο8 0.6 
= 1 — 0-825 34 
= 0174 66 to five decimal places. 


To find estimates by the numerical processes, a table of values is 
required. This is given below to five decimal places. 
x 0 0-1 0-2 0:3 0.4 0.5 0-6 
sin x | 0-000 00 0-099 83 0-198 67 0-295 52 0-389 42 0-479 43 0-564 64 
(i) Evaluating by the trapezoidal rule (45) of section 14.8.2 with N = 6, 
h = ΟἹ, gives an approximate value of 
$(0:1)[0-564 64 + 2(1:462 87)] = 4(0-1)(3-490 38) 
= 0-174 52 to five decimal places. 
(ii) Evaluating by Simpson’s rule (46) of section 14.8.3 with N = 6, 
h = 0:1 gives an approximate value of 
4$(0-1)[0-564 64 + 2(0-588 09) + 4(0-874 78)] = 4(0-1)(5-239 94) 
= 0-174 665 
or 0:174 67 to five decimal places. 
(iti) Evaluating by Weddle’s rule (47) of section 14.8.4 with N = 6, 
h = 0:1 gives an approximation of 
#5(0-1)[0-0 + 0-499 15 - 0-198 67 + 1-773 12 
+ 0.389 42 + 2-397 15 + 0-564 64] = 95(0-1)(5-822 15) 
= 0-174 664 
or 0-174 66 to five decimal places. 
The first two results are in error by approximately 0-08 per cent, 0-006 


per cent. The evaluation by Weddle’s rule is correct to five decimal 
places. 
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1.2 
Example (ii) Evaluate the integral [ exp (—x*) dx using a sub- 
Ὁ 


division of 0-2 in the argument and estimate the error involved. 


Solution This integral appears in statistical analysis and cannot be 
evaluated by using the standard methods of integration. Form the 
difference table for y = exp (—x?) from Xo = 0 to xy = 1:2 at equal 
intervals of h = 0-2. 


x y Ay A*y Δὸν Aty Δὸν Δὸν 
0 1-000 0 
0-039 2 
02 0:9608 —0-069 5 
—0-108 7 0-023 8 
0-4 0-852 1 —0-045 7 0-005 9 
—0-154 4 0-029 7 —0-008 6 
0-6 06977 —0-0160 —0-002 7 +0-001 7 
—0:170 4 0-027 0 —0-006 9 
08 0-5273 +0-0110 —0-009 6 
—0-159 4 0-017 4 
10 0:3679 +0-028 4 
—0-1310 
120-2369 


(i) The trapezoidal rule 
From equation (45) of section 14.8.2 with h = 0-2, N = 6, the approxi- 
mate value of the integral 


1-2 
[ exp (—x?) dx 
0 


is given by 
0:1[1-236 9 + 2(3-405 8)] = 0-804 85. 
(ii) Simpson’s rule 
From equation (46) of section 14.8.3 with h = 0-2 and N= 6 the 
approximation to the integral is 


4 (0-2)[1-236 9 + 2(0-852 1 + 0-527 3) 
+ 4(0-960 8 + 0-697 7 + 0-367 9) 
= 4 (0:2)[12-101 3] 
= 0-806 7(54). 


(iii) Weddle’s rule 
From equation (47) of section 14.8.4 with A = 0-2 and N = 6 the 
approximation to the integral is given by 
0:06(1-000 0 + 4-804 0 + 0-852 1 + 4-186 2 
+ 0-527 3 + 1-839 5 + 0-236 9) = 0-806 7(60). 
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An estimation of the error involved in each case may be obtained 
from the equations (48), (49), and (50) and the difference table. From 
(48), the estimation of the error in the trapezoidal rule is 


—(0-2)(—0-091 8)/12 = +0-001 5(3) 


and 0-806 4 should be a better result than 0-804 8. 
From (49), the estimation of the error in Simpson’s rule is 


—(0-2)(—0-006 4)/90 = +0-000 0(14) 


which is zero to four places of decimals. Since round-off errors in the 
entries are propagated into the higher differences the result estimated by 
Simpson’s rule is probably between 0-806 7 and 0-806 8. 

Similarly from (50) the estimated error in Weddle’s rule is 


—(0-2)(0-001 7)/140 = —0-000 0(02) 


and the result cannot be improved (to four decimal places). 

From a practical point of view the most reliable method of improving 
an estimation is to sub-divide the interval of the argument and recalcu- 
late the approximation. An examination of the two results will immedi- 
ately show how many decimal places may be claimed with accuracy. 
If in the present example the values of exp (—x?) are calculated to four 
decimal places at the values of x = 0:1, 0-3, . . ., 1-1, inserted in the 
table and Simpson’s rule used with 13 ordinates, the estimated value 
is found to be 0-806 7(36). The conclusion would be that, with the data 
given, the integral has the value 0-806 7 to four decimal places. 

This integral can be evaluated by other means and its value is 
0-806 74 correct to five decimal places. Any difference between the 
estimated value using Simpson’s rule (or Weddle’s rule) and the actual 
value would be due entirely to round-off errors in the column of entries. 


14.9 GENERAL REMARKS 


In any numerical work it is worth noting that to avoid introducing 
further errors the intermediate calculations are performed using as 
many decimal places as possible, or convenient, but that the result of 
all the calculations is rounded off to no more than the number of 
decimal places in the given data. If interpolation is required in a table 
given to four decimal places, the calculations are performed using at 
least five decimal places and the result rounded off to four decimal 
places. It is impossible to claim a result correct to more decimal places 
than are given in the original information. 

The original data frequently have round-off errors present and as in 
section 14.2.1 these errors are magnified in the higher differences. The 
result of a calculation may therefore have an error in the final decimal 
place claimed. Refer also to sections 13.5 and 13.6. 

The calculations in this, and the next, chapter have been performed 
using a desk calculator and, as the reader will find on carrying out these 
or similar calculations, the mistakes that will occur are due to the 
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operator and not the machine. The commonest mistakes made are tc 
enter the wrong number in the machine, e.g. 8432 instead of 8342, or 
to fail to account correctly for a decimal point. Such mistakes can only 
be avoided by checking each entry made and formulating a definite 
routine for each calculation. For a final check on the work it may be 
necessary to recalculate completely or, preferably, have someone else 
repeat the calculations. 

Familiarity with a machine is the best aid to correct calculations and 
this can only come with practice. 


EXERCISE 14 
1 There is a cubic relationship between x and y given in the following 
table. Show that there are two errors in the table, locate them and 
correct the table. 


x 0 1 2 3 4 5 
y | 97-336 132-651 175-616 226-981 287-496 357-911 


x 6 7 8 9 10 11 
y | 438:977 531-441 636-046 753-571 884-736 1030-301 

2 Form a difference table from the following data and deduce that 
J(x) is a quadratic form. Use the table to find the value of f(0). 
Use the Newton-Gregory forward interpolation formula based on 
x = 0 to find the quadratic function f(x). 


x 2 3 4 5 6 
70) 3-00 4-28 5:88 780 10-04 
3 Use the backward Newton-Gregory interpolation formula to find 


the cubic function of x which takes the values 2, 11, 32, 71, when x 
has the values —6, —4, —2, 0 respectively. 


4 From the following table use an appropriate interpolation formula 
to estimate /(16-4) and f(23-5) 
x 16 18 20 22 24 
ΚΑ) | 2613 293-7 3300 3722 422:3 
5 The following is an extract from a table of sinh x. Use the forward 
Newton-Gregory interpolation formula based on x = 4-60 to esti- 
mate the value of sinh 4-612. Also evaluate using the Stirling 
formula based on x = 4.61. Compare and comment on the results. 
4-60 4-61 4-62 4-63 
49-737 50-237 50-742 51-252 
6 Use an appropriate interpolation formula to estimate the values of 
£(0°3), f(2'55), f(1-75), and f(1-55) given the following data: 
x 0 0:5 1-0 1:5 20 2:5 30 
70) O 04804 08658 1-1317 1:3018 1-4070 1.471]3 


x 
sinh x 
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7 Show that the forward and backward Gauss formulae 


+1 
Y= yor (ῃ Ayo + (;) AYy_. + ie ) ay. τ. ᾿ 


-- 1 
) Ξι Ὁ (ἢ ) Avo + (5) A*yo + (5) Δὅϑν.. +... 


are identical if they are truncated by neglecting the fourth and higher 
differences. 

Deduce Bessel’s formula and use it to find cos-! 0-7 to the nearest 
minute given that 


cos 30° = 0-866 0, cos 40° = 0.766 0, 
cos 50° = 0:642 8, cos 60° = 0-500 0. 
8 When x takes the values 12, 22, 32, 42, 52, and 62 the corresponding 


values of f(x) are 62:14608, 62:34411, 62:53829, 62-728 77, 
62-915 69, and 63-099 18. Evaluate approximately f’(14) and χα. 


9 The function f(x) has a maximum value for 3 < x < 4. Given the 
following difference table use Bessel’s formula to find the maximum 
value of f(x). 


x 7) Δ A? A’ A‘ 
a το - - -. τ 0 τ πες 
3 1.219 710 —0-137 020 —0-024 872 
—0:014 934 0062 828 
4 1-204 776 —0-074 192 —0:023 366 


10 The following is a table of f(x) = x?/(1 + x*) from x =0 to 
x = 1-2 at intervals of 0-1. Find the approximate value of 


1.2 x? 
[ Tox ™ 


using (i) the trapezoidal rule, (ii) Simpson’s rule, and (iii) Weddle’s 
rule. In each case estimate the error involved. 


x fe) | x fe) 
0 0-000 000 06 0-318 697 


0-1 0-009 999 0:7 0-395 129 
0-2 0-039 936 08 0-454 030 
0:3 0:089 277 0-9 0-489 101 
0-4 0-156 006 1:0 0-500 000 
0-5 0-235 294 1-1 0-491 051 

1:2 0-468 506 


15 
Statistics I 


15.0 INTRODUCTION 
The study of statistical methods divides into two parts, descriptive 
statistics which consists in the collection, presentation, and summarizing 


of data and statistical inference which is concerned. with drawing con- 
clusions about the source of data. 


15.1 STATISTICAL EXPERIMENTS 
In the design of an experiment, two main aims are required. These are 
(i) that it is possible to judge the validity of the conclusions, and (ii) that 
the experiment is efficient in that conclusions may be drawn on as few 
observations as possible. 

A statistical experiment must be capable of being repeated, or be such 
that it can be considered as being repetitive. 


Example (i) An unbiased coin tossed 12 times may produce the result 
HTTTHTAHTTHTT 


or 87, 4H where T stands for ‘tails’ and H for ‘heads’. The experiment 
of tossing a coin 12 times can be repeated say 6 times to give results as 
follows: 


T 8 10 11 9 10 9 
Η 4 2 1 3 2 3 


This experiment shows random variation from one experiment to the 
next. 


Example (ii) In an agricultural experiment two types A and B of wheat 
are grown on ten plots with the following yield in kg/m?: 


Al 0-29 0-27 0-28 0-27 0:36 0:30 0-28 0-28 0:33 0:29 
B| 0:30 031 0-28 0-29 0:27 0-30 0-32 0-30 0:27 0-28 


This experiment is thus repetitive and shows random variation. 


A typical statistical problem occurs in mass production where it is 
necessary to maintain a given standard. For example, a machine is to 
produce ball-bearings of 2 mm diameter, and from a set of fifty observa- 
tions it is to be decided whether or not the standard is being maintained. 
It is totally impracticable as well as expensive to measure the whole 
output of the machine. The experiment is repetitive in that many sets 
of fifty observations may be made. Assuming that the machine is set 


STATISTICS I 395 


correctly, the measurements from each set of fifty observations will 
show random variation. 


15.1.1 Definitions 


A population is any set of objects about which numerical information 
is required. 


An individual is any member of the population. 
A sample is any set of individuals. 


An attribute is any property which any individual may or may not 
possess and about which information is required. 


Example (i) All the results which would be obtained in an infinite 
number of tosses of a coin is a population. Twelve such tosses constitute 
a sample from this population and each result, H or T, is an individual 
from the population. The attribute considered could be that a head is 
tossed, and the number of times this happened would be counted. It 
would be expected that in the long run, there would be 50 per cent heads 
and 50 per cent tails. 


Example (ii) The heights of all men in England is a population of 
heights. The attribute under consideration might be the average hei ght, 
or the maximum or minimum height. A sample from this population 
might be the heights of 100 men chosen at random, and the statistical 
problem to infer properties of the population from the sample. 


15.2 PROBABILITY 
The whole of statistical methods depend upon probability theory. 


15.2.1 Definitions 


An experiment or trial is the establishing of certain conditions which 
may produce one of several results or cases. 


Example (i) toss a coin, (ii) toss a coin twice, (iii) toss two coins, 
(iv) measure the diameter of a ball-bearing. 


An event is a possible outcome of an experiment. 


A random variable is a variable which specifies the result of an experi- 
ment. For example in the coin-tossing experiment, the random variable 
x may be defined as x = 0 if a head is thrown, x = 1 if a tail is thrown. 


Mutually exclusive events: Two events E, and E, are said to be mutually 
exclusive if E, precludes E,. More generally, £,, ΕΩ, Es,..., E, are 
mutually exclusive if all pairs (E,, E,), i # j are mutually exclusive. 
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For example in the experiment of tossing two coins, 


(i) let Εἰ be the event H, T, and let Ἐς be the event H, H, then Εἰ and 
E, are mutually exclusive. 


(ii) let E, be the event H and 7, i.e. HT and TH. Let E, be the event H, 
then £, and E, are not mutually exclusive. 


The event either E, or E, is the event that either Εἰ or EZ, or both, 
happens. 

The event both E, and E, is the event that both Εἰ and EF, happen. 

For example, if the experiment is to throw a six-sided die, and Εἰ 
the event that the score is odd, Ες the event that the score is a multiple 
of 3. Then the event (either E, or E,) is the event that the score is 
1, 3, 5 or 6. The event (both E, and E,) is the event that the score is 3. 


Mutually exhaustive events: The events Εἰ and E, are said to be mutually 
exhaustive if either £, or E, must happen. For example, in the experi- 
ment of rolling a single die, E,, the event that the score is even, and E, 
the event that the score is odd, are mutually exhaustive. More generally, 
Ey, Ey, Eg, .. . E, is a mutually exhaustive set if one of the events 
E,, E,, E3, . . . or E, must happen. 


An elementary event is an event which cannot be decomposed into 
simpler events. For example the event E that the score is 5 on rolling 
a die is an elementary event. The event Ε that the score is a multiple of 
3 can be decomposed into the elementary events £,, that the score is 3 
and E£,, that the score is 6. 


15.2.2 Empirical definition of probability 


It is not usually possible to say that the outcome of an experiment will 
be E, but it may be possible to assign a probability to E. Suppose the 
event Εἰ occurs m times in m repetitions of an experiment. Then if as 
n—> oo the ratio m/n -- a limit p, this limit is called the probability of 
the event Ε. This is the empirical or statistical definition of probability. 


Example (i) In the experiment of tossing a single coin, let r be the 
number of heads thrown, and be the total number of throws. Then 
the probability of tossing a head with this coin is 


Lim [(Mmumber of heads)/(total number of throws)] 
which may be written in symbols as 
P{H} = Lim (r/n). 
Example (ii) In the experiment on wheat yields of section 15.1, 


P{wheat yield is between 0-28 and 0-31 kg/m? 
= Lim [(number of plots with this yield)/(total number of plots)]. 


STATISTICS I 397 


Properties 

If E is certain, m=n and P{E}=1 (1) 
If Eis impossible, m=0 and P{E}=0 (2) 
Since 0 <m - ἡ, 0O<p<l. (3) 


15.2.3 Addition theorem 
If E, and £, are mutually exclusive events 
P {either E, or Ez} = P{E,} + P{E,}. (4) 


Let an experiment be performed x times, and let E, be obtained 7, 
times and E, be obtained 52 times. Then (either E, or E2) has been 
obtained (7, + mg) times. Hence 


Pfeither £, or E,} = Lim [(n, + n.)/n] 
= Lim [n,/n + n,/n) 
= Lim (7,/n) + Lim (n/n) 
= P{E,} + P{E,}. 
This result is extended as follows. If E,, E., E3,.. ., E, are n mutually 
exclusive events, 
P{E, or E, or E3 or. . . or E,} 
= P{E,} + P{E.} + P(E} +... +P{E} 6 
Corollary 1 If E,, E2, E3,.. ., E, are also mutually exhaustive then 
P{E,} + P{E,} + P{E3} +... + P{E,} = 1 (6) 
Proof The event (E, or E, or ἔξ or . . . or E,) must happen so that 
P{E, or E, or Ez or. . . or E,} = 1 
and combining this with equation (5) gives equation (6). 
Corollary 2 If an experiment must result in one of n mutually exclusive 


events, i.e. they are mutually exclusive and exhaustive, which are all 
equally likely, then 


P{E,} = P{E,} = P{E,3} =. . . = P{E,} = 1/(total number of events) 
= I/n. (7) 
Proof Since the n events are exhaustive, from equation (6) 
P{E,} + P{Es} + P{E3} +... + P{E,} = 1, 
and since they are all equally likely, 
P{E,} = P{E,} = P{E3} =... = P{E,}. 
Hence the result (7). 


Example If x is the score on rolling a die, then 
P{x = lp = P{x = 2 = P{x= 3} =... = Pix = 6) =}. 
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Corollary 3 Let the result of an experiment be one of the finite number 
of mutually exhaustive and elementary events E,, E2, E3,.. ., E,, and 
let all these events be equally likely. Let Ε be the event defined by the 
occurrence of m of these elementary events. Then 
PE} = min, (8) 
since P{E} = P {any m of the events E;,. . ., Ej} 
= sum of probabilities of these m events 
= m/[n. 
number of outcomes favourable to E 
total number of outcomes 


Thus P{E} = (9) 


and this is the result used in calculating probabilities. 
Example (i) If a single coin is tossed, the probability of getting a head, 
ie. P{H} = ἃ. 
Example (ii) In one throw of a die, 
P {score is 5} = 4; 
P {score is even} = 3/6 = 3. 


Example (iii) Find the probability that the score is 8 if two like dice 
are thrown. 


Solution A total score of 8 may be obtained from the pairs (2, 6), (3, 5), 
(4, 4), (5, 3), (6, 2). The total number of possible events is 36 and there 
are 5 outcomes favourable to the event that the score is 8. 


Die 1 
Table 15.1 
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Table 15.1 shows the representation of each outcome by the pair 
(r, s), r being the score on the first die, s the score on the second. 
Hence, P {total score is 8} = 5/36. 

Example (iv) What is the probability of selecting a red ball in one 
draw from a box containing 5 red and 3 black balls? 
Solution The total possible number of outcomes is 8, and of these the 
number favourable to the event, drawing a red ball, is 5. Hence, 

P {ball selected at random is red} = 8. 


15.2.4 Use of permutations and combinations 
The number of permutations of n unlike objects taken r at a time is the 
number of ordered arrangements of n objects taken r at a time. This is 
nin --- I(t -- 2). . .(2 — r + 1), which is written 

"P= nl/(n—r)!i, 
where by definition, n! = n(n — 1)(n — 2)... .3.2.land0!=1. 


Example (i) The number of different arrangements of the letters 4, B, 
C, Dis 4! = 24. 
Example (ii) The number of permutations of 2 letters from the 4 letters 
A, B, C, Dis 

‘Po = 41/2! = 12. 

The number of combinations of n different objects taken r at a time 
is the number of ways of selecting r objects from the n different objects, 
without regard to order. The number of arrangements is *P, and each 
arrangement can be permuted in r! ways. Hence the number of selec- 


tions is 
"PJ! Ξε αἰ (νοι --- γ)! 


sided F n 
which is written "C, or (") 


The number of ways of dividing πὶ different objects into three groups, 
the first containing n,, the second ng, and the third ng objects, where 
πὶ + ng + ng = Ἢ, is n!/(n,! ng! ng!). This is proved as follows. The 

: ᾿ n ae 
first group is selected in fb ways, and once this is fixed there are 


1 
(nm — n,) objects from which to select mz, and this can be done in 


(" a Ἢ ways. The third group is now fixed. Hence the total number 
2 


of ways of dividing the n objects is 


") τ ae 1: πὶ (n --- n,)! 
ny Ng ee ps Wn — 1)! παῖσι — my — ng)! 


ni 


= —_——_ since n — n, — ng = Ng. 
11! πα! Ne! a pee 
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Extending this argument, the number of ways of dividing n objects into 
k groups, the first containing m,, the second nm, and so on, objects, so 
thatn, tng tng+...+m=nis 


n! 
Ny! Mg! ngl. . «πρὶ 


Example (i) A box contains 5 red and 3 blue balls. Find the probability 
of the event Ε that two balls selected at random are both red. 


Solution The total number of possible outcomes is the number of ways 
(3) . The number of outcomes favourable 
to Ε is the number of ways of selecting 2 red from 5 red balls which is 


(ἢ, thu, 
κῶς 6) τα 


Example (i) A box contains 2 red, 4 blue, and 5 white balls. Find the 
probability of the event E, that 4 balls selected at random contain 
exactly 2 white balls. 


of selecting 2 from 8, which is 


Solution The total number of possible outcomes is the number of ways 
11 

of selecting 4 from 11, which is ( a! The number of outcomes favour- 

able to the event Ε is the number of ways of selecting exactly 2 white 


6 
from 5 white and 2 others from the remaining 6 others. This is (3) ( ) 


ne β) 0 0} τοι 


15.2.5 Use of the complementary event 


Hence 


The events Εὶ and £ are complementary events if £ occurs whenever E 
does not. Thus Ε and £ are both mutually exclusive and exhaustive. 
Hence 


P{E} = 1 — P{E}. (10) 
If P{E} = p and P{£} = q, then 
pt+q=1 or q=l—p. 61) 


Example (i) A box contains 2 red, 4 blue, and 5 white balls. Find the 
probability that 4 balls selected at random contain at least one white 
ball. 
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Solution Let E be the event that at least one of the four selected is 
white, then Ε is the event that none of the four is white. Now 


μας ()/ (2) = 


and hence P{E} = 21/22. 


Example (ii) In an experiment in which a coin is spun 12 times, find 
the probability that at least 2 heads will be thrown. 


Solution Let E be the event of obtaining at least 2 heads. Then the 
probability of a head (H) in one throw is 3. The number of ways of 


‘ 12 3 
obtaining exactly 2 heads and 10 tails is e ) and the probability of 


τς, {12 (1Ὰλ3 (1\2° 
wisis [7} (5) (5) 
There are two methods of obtaining P{E}. 
Method (a) The probability of the event E is the probability of obtain- 
ing 2, 3,4,.. ., 12 heads. Hence 
P{E} = P{2H or 3H or. . . or 12H} 
= P{2H}+ P{3H}+...+ P{12H} 
" " ἫΝ Ἵ 1\3/1\9 a )" ;): 
Ξ.2)3) 8) τ13}.2) ἃ) +s Fla) a) ap 
Method (b) E is the event of obtaining at least 2 heads, so that the 
complementary event £ is the event of obtaining no head or one head. 


n= (3) +(7)G) 6) - 5 6)" 
and P{E} which is | — P{E} is given by 
P{E}=1— 13(5) - 


15.2.6 Conditional probability 
The probability that the event E, will occur knowing that the event Ey 
has already occurred, or is certain to occur, is called the conditional 
probability of E2, given E,. This is written P{E2| Fy}. 

Suppose Εἰ and E, are outcomes of an experiment which is repeated 
n times. Let E, occur n, times, and let the number of times both E£, 
and E, occur be mg. Then the probability of EZ. given that Εἰ is certain 


= Lim (72/1) 


: Ng [ἢ 
= tim (73/3) 
= Lim (a,/n)/Lim (1,/n) 
= P{both E, and E,}/P{F}}. 
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Hence the Multiplication theorem follows in the form 


P{both Εἰ and Ey} = P{E,}P{E,| Ej}. (12) 
Two events E, and Ἐς are said to be independent if 
P{both Εἰ and Ἐς) = P{E,}P{E,} (13) 


Le. if P{E,| Ey} = P{E,}. 
The definition of independent events is extended to the k events 
Ey, Eo, Es, . . ., Ex, which are independent if 


P{E, and E, and. . . and E,} = P{E,}P{E.}. . .P{E,} (14) 
Example (i) Two coins are tossed. What is the probability that the first 
coin shows a tail and the second a head? 


Solution Let E, be the event tail on the first coin and E, be the event 
head on the second coin. Then since all possible outcomes may be 
written 


HH, HT, TH, TT, 
the required probability 
Pi{both Ε, and E,} = } =} x 4 = P{E}}P{E}. 
Hence Εἰ and E, are independent events. 


Example (ii) Two dice are rolled. What is the probability that the total 
score is eight if it is known that the dice show at least three points each? 


Solution Let E, be the event that the dice show at least 3 points each, 
E, the event that the total score is 8, then, using Table 15.1, 

_ 336 3 

~ 16/36 16 
Example (iti) Three dice are rolled. If no two dice show the same face, 
what is the probability that at least one face shows a six? 


Solution Let E, be the event that no two faces show the same score, 
Ἐς the event that at least one face shows a six, then £, is the event that 
no face shows a six. Then 

P{E2|Ey} ΞΞῚ - P{E2|E,}. 
Since there are 3! ways of arranging sets of three, 


P{E,} = (3) (3) 3!= (6.5. 4)/63; 


or alternatively, P{E;} = P{\st is any number and 
2nd is a different number and 
3rd is different from 1st two} 
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5 4 4.3 
Also, P{both £, and E,} = ae 


6667 δ 
Hence, P{E,|E,} = P{both FE, and E,}/P{E,} 
= (5.4.3)(6.5.4) =}. 
Hence P{E,| Ey} = 4. 


153 RANDOM VARIABLES 


Consider an experiment A which may have several outcomes. It is 
convenient to introduce a variable x to correspond with each event of 
A. For example if a coin is spun the random variable may be defined by 
x = 0 if the event is heads, x = 1 if the event is tails. If two coins are 
spun and the number of heads (H) counted, the random variable may 
be defined as x = O if no H, x = lif 1 H (HT or TH), and x = 2if 
2H. In the experiment of wheat yields of section 15.1, x may be defined 
as the yield in kg/m?, in the experiment of measuring the diameter 
of ball-bearings x may be defined as the diameter in mm. The random 
variable x is defined as a number associated with each event of an 
experiment, random since its value depends on the outcome of an 
experiment and hence on chance. 


15.3.1 Discrete random variables 


A discrete random variable is a random variable that can take only a 
finite number of distinct values which can be arranged in a definite 
order. 


Let the random variable x take values x,, ΧΩ, X3, - - +» Xn Corre- 
sponding to all the 7 events of an experiment. 
Let Pi = P{x = x3} 
pPa=P {x = x2} 
Ps = P{x = x} 
pPrz Ρ {x = Xn} 


Then the set of numbers p,;, i= 1, 2,3,...,”, defines a frequency 
function f(x) which gives the probability that the random variable x will 
assume any value in its range. A frequency function is often given as a 
table of values. Thus, if a single die is rolled and x is defined as the 
score, the table of values is given by Table 15.2 


x 1 2 3 4 5 6 
f@) | 16 1/6 1/6 1/6 1/6 1/6 


Table 15.2 
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It is sometimes convenient to graph the frequency function f(x) by a 
line graph, as in Fig. 15.1. 


Fig. 15.1 


If two dice are rolled, let x be the total score, then Table 15.3 is obtained 
from Table 15.1. 


x 2 3 4 5 6 7 8 9 10 11 12 
f(x) | 1/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36 
Table 15.3 

and the line graph is shown in Fig. 15.2. 


f(a) 


Fig. 15.2 


Since fo) =P{&=x}=p, i=1,2,3,...,n, 


and there are exactly n events associated with the experiment, 
n 
> P 4 Ι, 
ἐξὶὶ 


or Σ fx) = 1. (16) 
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The frequency function determines the distribution of any event 
associated with the experiment. For example 


Ρίχ, <x -- χρῷ τ Ρίχ τὸ χῇ - Ρίχ Ξξῷ ὡαχλ,ρεὰ}τὲ ... +P =x} 
= f(x) τ fGra1) +. . +S) (17) 


An important function associated with the frequency function is the 
distribution function F(x), sometimes called the cumulative distribution 
function, which is defined by 


F(x) = 2 fo. (18) 


where the summation is taken over all those values of x less than or 
equal to a given value. Hence 


F(x;) = = fO 
= f(xs) + fla) + f(%s) +... +f); 


or F(x) = Pi + Po + Ps + Pree er + Pio (19) 


i.e. F(x;) gives the probability that the random variable x is less than 
or equal to the given value x,, whereas f(x,) is the probability that the 
random variable x is equal to x,. Hence 


Pla <x <b} = F(b) — Fa). 


The graph of F(x) for the frequency function of Table 15.3 is shown in 
Fig. 15.3. 


12345 6 7 8 9 10 11 12 13 14 


Fig. 15.3 


15.3.2 Continuous random variables 


If a random variable is capable of taking any value in a given interval 
or intervals it is called a continuous random variable. Thus the random 
variable x defined as the height of a man from the population of the 
heights of all men in a given place is a continuous random variable, 
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since it can take any value in an interval say from 1-5 metre to 1-8 
metre. Variables which involve measurement such as height, length, 
temperature, and weight are continuous variables. 

As an example consider the set of numbers x representing the 
diameter in mm of 200 ball-bearings, the diameters being measured to 
three places of decimals. Values of x are thus regarded as values of a 
continuous variable whose values have been rounded off to three 
decimal places, so that a measurement of 0-155 mm means that the 
actual measurement is between 0-1545 mm and 0-1555 mm. The fre- 
quency or number of times the value x; occurs is denoted by ἤ,, so that 
ΣᾺ = 200 and the resulting frequency distribution is shown in table 
1 


form in Table 15.4. 


x 0-115 0-125 0-135 0.145 0-155 0-165 0.175 0-185 0-195 0:205 0-215 
f 6 9 11 21 31 39 32 25 12 10 4 


Table 15.4 


The frequency here denotes the absolute frequency with which x occurs, 
and this table is the type of result usually obtained in an observed 
frequency distribution. The word frequency more often denotes the 
relative frequency, which is the ratio of absolute frequency to the total 
number of observations, since relative frequency approaches probability 
as the number of observations increases indefinitely. 


40 


Fig. 15.4 


The results of Table 15.4 are represented graphically by the histogram 
of Fig. 15.4 in which areas represent absolute frequencies: for example, 
the area of the rectangle the centre of whose base is 0-115 represents 
frequency 6. The total area of the histogram is thus 200. The length of 
the interval on the x-axis, here given by 0-01, is usually denoted by ἢ 
and called the class-interval, 
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If the measurement of 200 ball-bearings is considered as 200 runs of 
an experiment, a further 200 runs would result in a histogram twice the 
area of that in Fig. 15.4. In order to compare histograms based on 
different numbers of experiments, it is more convenient to choose the 
scale on the ordinate-axis to represent relative frequency f{/n so that 
the total area is 1. With this choice of unit the histogram approaches a 
fixed histogram as n~> oo. Also if the measurements x can be made 
as accurately “as we please’, the class interval ἢ can be chosen as ‘small 
as we please’ so that the histogram approaches a smooth curve as 
h—»0 and ἡ -- oo. If this curve can be represented by f(x), this is the 
frequency function of the continuous random variable x. For a con- 
tinuous random variable the frequency function is sometimes called the 
probability density function. 

The frequency function for the continuous random variable x is 
defined as the function f(x) which has the following properties: 


Ὁ 70)50, (20) 
(compare with ρὲ >0_ all i). 
(ii) [ f(x dx =1; (21) 


(compare with > p; = 1). 
i 


ὃ 
(iii) Ϊ Ι)αχ-ερία -χ <b} (22) 
a 
where a, b are any two values of x; (compare with P{x, <x < x,} 
s 
= 2 70. 
The distribution function F(x) is defined by ἔα) τί > f(t), the 
t<2z 


summation now being the limit of a sum and hence an integral, or 


Fay= [fod (23) 
and S(x) = ΕΟ). (24) 


The value of statistical methods is in finding mathematical models to 
fit as closely as possible the observed or empirical data. There are 
standard frequency functions which are used most often as models, and 
some of these are demonstrated in the following examples. 


Example (i) Consider the possibility of using 


λε. *# x>0 
fo) = {9 x<0 


as a frequency function for x, where 4 is a positive constant. 
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Solution Checking the properties (20) and (21) gives 


(a) Ae-**7 >0 all x; 
(b) | Κα dx - Ae~ ** dx 
-ω Ὁ 
[1 
ο 
=1 foralla, 


The distribution function F(x) is given by 


Fay= [fod 


floc) = λε λα 


Fig. 15.5 Fig. 15.6 


The graphs of f(x) and F(x) are shown in Figs. 15.5 and 15.6, and 


δ 
ρα -α«)}-- [ΓΚ ax 


= (δ) -- Κῶ 
=I-—e”—1+e-% 
= e744 — eA. 


Although the choice of f(x) as a negative exponential appears to be 
arbitrary, it is found that this function can be used as a good approxi- 
mation to many observed frequency distributions. 


Example (11) A uniform, or rectangular distribution, is of use when the 
continuous random variable x is equally likely to take all values between 
two given values a and b. Consider the possibility of using f(x) = ς, 
where c is a constant, as the frequency function for x. 
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ὃ 
Solution Since, from equation (21), I Ss) dx = 1, then ὁ = 1/(6 — a). 
a 


Hence the frequency function becomes 


1 
{GV =1b= ἃ fora<x<b, 
0 for all other values of x. 
The distribution function F(x) is given by 


Fa)= [πο = α -- al — ὦ, 


and the functions f(x), F(x) are sketched in Figs. 15.7 and 15.8. 
f 


ο α δ ax 
Fig. 15.7 Fig. 15.8 


Example (iii) The life x, in hours, of a certain type of electronic valve 
has the frequency function 

J(x) = 100/x?, x > 100; 

ff) =0, x < 100. 
(a) Find the probability that the life of a valve is less than 150 hours. 


(b) If the valve is still working after 150 hours, find the probability that 
it will have to be replaced before 250 hours life. 


Solution 
150 
(a) P{x < 150} = [ (x) dx 
150 100 
=0-+4 aT dx 


(Ὁ) Let £, be the event x > 150 and let E, be the event x < 250. Then 
P{E,|E,} = P{E, and E}/P{E,} 
= P{E, and E,}/{1 — P{£,}]. 


Now, from (a), P{E,} = 3. 
250 1 4 
Also P{E, and E,} = [ sa dx == 
iso * 15 


Hence P{E,|£,} = ὃ, and this is the probability that a valve, if it lasts 
more than 150 hours, will cease to function before 250 hours of its life. 
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15.4 REDUCTION OF DATA FOR AN OBSERVED 
FREQUENCY DISTRIBUTION 


A mass of raw data needs to be ordered if it is to be easily read and 
understood. Measures of location and dispersion are used to describe 
the distribution in few and simple parameters. 


15.4.1 Measures of location 


(a) The mean or arithmetic average is the most important measure of 
location. For a discrete random variable, suppose there are f, values 
of x =x, i= 1,2,3,...,k, then the mean m or is defined by 


(SA x,) [ι, where ΣΛ =n. 


Example Find the mean of the distribution given by the following 
table: 


x 0 1 2 3 4 5 
f 14 15 23 22 14 12 
Solution df = 100 =7, 
xXorm 
=(0 x 1441x1542 x 234+3x 224+4x 1445 x 12)/100 
= 2.43. 


For a continuous random variable, as in Table 15.4 of section 15.3.2 
it is necessary to assume that approximately all the observations in a 
given class interval have the value at the mid-point of that interval. If 
x, is the mid-point of the ith interval, and f, is the corresponding class 
frequency, then as before: 


X orm = (> f,x,)/n. (25) 


The mean of the distribution of Table 15.4 is 0-165 3. 

(b) Suppose there are observations which can be arranged in order, 
then the median is the value of the middle observation. If n = 2k + 1 
(odd) the median is the value of the (k + 1)th observation. If n = 2k 
(even) the median is the value of the average of the Ath and (k + I)th 
observations. The median is of use for example, in describing the wage 
level in a certain industry where a few very high wages affect the mean 
but not the median. The median wage is such that half the employees 
receive at least this much. 

(c) The mode is the observation which has maximum frequency. For 
example for the set of measurements 4, 4, 5, 5, 5, 6, 7, 7, 8 the mode 
is 5. A single mode gives a rough and quick measure of location but 
is not very useful, particularly if the histogram has more than one peak. 
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15.4.2 Measures of dispersion 
These are measures of the spread of the observations. 


(a) The range is the difference between the largest and smallest measure- 
ments in a set of measurements. It is used in quality control. 

(b) The mean deviation is the average of the deviations of each observa- 
tion from the mean, all deviations being taken as positive, i.e. 


1 π 
mean deviation = a > [x — μὴ 
i=1 


1 * 
or ᾿ > [x1 -- fi 
f=] 


(c) The variance and standard deviation, which is the square root of the 
variance, are the most important measures of dispersion. Variance, 
denoted by 52, is defined by 


= 25 (ey — mf, (26) 


and the standard deviation s is the square root of this. 
(4) The interquartile range is defined as Q3 — Q,, where Q, is that 
value such that } of the observations are less than Q,, Q, is the median, 
Ος is such that ξ of the observations are less than Q,. 


15.55 MEASURES OF LOCATION AND DISPERSION 
FOR A PROBABILITY DISTRIBUTION 


15.5.1 The mean 


(a) First, consider the mean of a probability distribution for a discrete 
random variable. Let an experiment be repeated n times, and let there 
be ἢ observations x, i= 1,2,3,...,k. Then the mean of the n 
observations is m, and using the definition of section 15.4.1, 


Now asn — οὐ, f(/n — p, where p, = P{x = x,}, and hence γι -- > p,x,. 
Hence the mean of a probability distribution for a discrete random 
variable is defined as 


B= DP Χο (27) 


(b) Second, consider the mean of a probability distribution for a con- 
tinuous random variable. Let an experiment be repeated n times, and 
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let the observations be grouped into classes of width ἢ = Ax. Let ἢ 
be the number of observations in the ith interval and x; be the mid- 
point of that interval. Then m, the mean of the grouped distribution is 
given by 


Now, as n ~—> οὐ, the value of f,/n approaches the probability that x lies 
in the ith interval, i.e. f/n > f(x,) Ax, where f(x) is the frequency 
function. Hence, as n> οὐ, m—> S{f(x,) Ax}x, If also Ax — 0, then 
m—> Jxf(x) dx. Hence the mean for a continuous random variable is 
defined by 


p= | feo a, (28) 


the integration being over the range of values of x for which f(x) is 
defined. 

The mean for any distribution is thus defined by the first moment of 
the distribution. Compare with formula (12) of section 7.2.3. 


15.5.2 Dispersion, variance 


(a) First consider the variance of a probability distribution for a 
discrete random variable. For observations x, where x, occurs ἢ 
times, i= 1, 2,3,.. .,k, 


1 
3 -- nahi — m)? 


As ἢ. - oo, the value of (f;)/n — p; = P{x = x}. Hence the variance 
of the probability distribution is defined by 


= 2, pix — μ)". (29) 


(b) Second, consider the variance of a probability distribution for a 
continuous random variable. For n observations the variance of the 
grouped distribution is 


52 = Sle — im)? 
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Asn -- ©, the value (/;,)/n —> f(x,) Ax, and if at the same time Ax — 0, 
the variance of a continuous probability distribution is defined by 


σ᾿ Ξε i (x — ἡ") dx. (30) 


The integration is taken over the range of values of x for which f(x) 
is defined. 

The variance of a distribution is defined by the second moment of the 
distribution. Compare with formula (20) of section 7.2.7. 


Example (i) Let x be the score recorded when a die is rolled. Find the 
mean and variance for the probability distribution. 


Solution The variable x is discrete and the following table gives the 


(rectangular) probability distribution. 


a a 2. 3 4. 5. ὁ 
ριξε ἐ ἐ ὁ ἐ 


The mean μ and variance o? are given by 
w= Upx,= 41 4+2434...46=3'5, 
o? = Lp(x, — pw)? 
= 41 — 35)? + (2 — 3-5)? +... + [(6 — 3:5) 
= 35/12. 


Example (ii) Find the mean and variance for the rectangular distribu- 
tion given by 


1 
tym [8 “ax <4, 
= 0, when x is not in the interval (—a, a). 


Solution Using formulae (28) and (30), the mean μ and variance o? are 
given by 


b= [ χ f(x) dx = i 3(x/a) dx = 0, 
= ic (x — 0)?f(x) dx = ξα3. 


Example (iii) Find the mean, variance, and median for the negative 
exponential frequency function f(x), defined by 


Ae~ *, x>0 
I= " x< 0. 
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Solution The mean μ is given by 
= [ xf(x) dx = [ xhe~ 4# dx = 1/A, 
Ὁ «Ὁ 


on integrating by parts. 
The variance is o?, where 


= { (x — 1/A)?Ae~ ** dx 
0 


The median x,, is given by 
lnm 
{ he- ** dx = 3, 
9 


since half the population lies each side of the median. Hence 


το] τὶ 


or 1—e~¥n = fj, 


Solving, Xm = (log 2)/A. 


15.6 PRACTICAL COMPUTATION OF MEAN AND 
VARIANCE 


It is possible to simplify the arithmetic involved in computing the mean 
and variance by coding the data. There are two processes of coding 
used, (a) changing the origin, or using a ‘working zero’ and (b) changing 
the scale, or unit. Changing the origin involves writing y = x — x9 
and changing the scale involves writing u = y/c, where the constant xo 
is the new origin, or working zero, and c is any convenient constant. 


15.6.1 Computation of the mean 


Writing γι = x; — Xo, μι = y,/c, means that x, = cu, + Xo for each i, 
and hence, 


X= οὔ + Xp. (31) 


Example Find the mean of the three numbers 
x = 0-169, 0-162, 0-170. 
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Solution Write y = x — 0-160 = 0-009, 0-002, 0-010, and u = 1 000y 
= 9, 2, 10. In working units the mean is 4(9 + 2 + 10) or 7. Partly 
decoded it is 0-007, and fully decoded the mean is 0-167. Alternatively, 
ὥ = 7, ¢ = 1/1000, x, = 0-160, so that 


x = 0-007 + 0-160 = 0-167. 


15.6.2 Computation of variance 
(a) Changing the origin to x, by writing y = x — Xo gives the result 
52 = 33, (32) 

ie. coding by changing the origin has no effect on the variance. This 
may be proved as follows: 

1 

$= LS — Ὁ} 
1 
ΞΞ esis — Xo — ¥ + Xp)? 


1 
= => fie — 5) 
εἰ 
= s2. 
(b) Changing the scale so that u = y/c gives the result 
su = Sy/c?, (33) 
or, variance of u = (variance of y)/c?. 


This is proved as follows: 


1 
s8= 5D fu — ἃ» 
t 

1 1 μι 

“52h εξ: 
11 : 

= 2S — 3 
1 

Hence 52 = δῇ = c752, (34) 


From the example of section 15.6.1, 

u = 1 000y = 9, 2, 10, 

u= 7, 

52 = 4(2? + 5? + 3?) = 38/3. 
Hence 52 = 38/33 x 105. 
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15.6.3 Modified form of formula for variance 
Because the mean ¥X is not usually a simple number, and even the coded 


mean ἢ may not be simple, the calculation of variance is often simplified 
by using the identity 
Lf(u, — 0)? = x fu? — πῆ. 
This identity is not difficult to prove, for if Σ ἢ, = n, 
Lf(u; — a)? = X fu? — 2uaz + ἢ) 
ΞΕ Σ fui — 22 Σ fu,+ 7? Xf, 
Now (2 f,u,)/n = a, and hence 
Σύ, — i)? = X fu? — 2a(nd) + wn 
= Σ fu? — ni’. 


Thus the modified form of the variance s? is given by 
1 1 & 
s=- > fu, — a? = (; > μα) — πῇ. (35) 
π {Ξξιὶ Miz 
Alternatively, equation (35) may be written 
ΓΕ Ww ee 4 
t= ΣΙ Σ sat —2 (3 seu) | (36) 
ἐπ 1 ἐξ 


Example Find the mean and standard deviation for the following 
distribution. 


x 995 9:96 9-97 9:98 9-99 10-00 10-01 10-02 10-03 10-04 10-05 
f 2 4 5 8 10 16 15 17 11 7 5 
Solution The work is set out in table form as follows, with 
u, = 100(x, — 10-00). 


x f u uf wf = (uf)u 
9-95 2 -5 —10 50 
9-96 4 —4 —16 64 
9:97 5 -3 —15 45 
9-98 8 —2 —16 32 
9-99 10 -1 —10 10 

10-00 16 0 0 0 
10-01 15 1 15 15 
10-02 17 2 34 68 
10-03 1 3 33 99 
1004 vi 4 28 112 
10-05 5 5 25 125 


Totals 100 68 620 
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The calculation proceeds as follows: 
4 = 68/100 = 0-68, hence the mean X is given by 


x = 0-006 8 + 10-00 = 10-006 8. 


Also δὰ = 620/100 — (0-68)? = 6-20 — 0-462 4 = 5-738. 
Hence s? = 0-000 573 8, and the standard deviation is given by 


5: = 0-023 96 = 0024. 


15.7 THE BINOMIAL DISTRIBUTION 


There are three standard theoretical frequency distributions to which 
practical, or observed, distributions may approximate closely. The first 
of these is the binomial distribution. 

Suppose the result of a trial is only success or failure, i.e. a particular 
event occurs or does not occur. Let the probability of success be » then 
q = 1—p is the probability of failure, since success and failure are 
independent and exhaustive events. Let there be n trials and let r be 
the number of successes recorded; and further, suppose that successive 
trials are independent. The problem is to find the frequency distribution 
of r, which can take only integer values. 

The probability of obtaining r successes and (n — r) failures in a 
specified order, say the first r trials result in success, is 

r (n —r) 
rr is 
PsP +P. ΟΡ 4φι4φιφον ον 
which equals pq. 
The number οἵ different ways in which r successes can occur in ἡ trials 
. [n Ss : sata 
18 (a Hence the probability of r successes in any order is (") pq. 


This expression is the (r + 1)th term in the expansion of the binomial 
expression (q + ρ)" and is called the binomial frequency function f(r), i.e. 


fr) = (") pq. (37) 


The symbol r is used to emphasize that r is a discrete variable which 
can take integer values only. Hence 


n 
Pr = 0) = (5) pg" = φ' 
Pir = ἢ} Ξ (ὴ Ρ᾽ 45} = πρᾳ" ", 


Ee = (3) peg’? = {πίη — 1)p?q"-?, 
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and so on. The terms on the right are recognized as successive terms 
of the binomial expansion of (q + p)", since 


q+ Pp) = 4" + ng™ip + ann — Ig"? +... +p (38) 


= 3) ne 


Also, since q + p = er then 
nr 
> fr) =1. (39) 
r=0 
15.7.1 Mean and variance of the binomial distribution 


Mean of r= wp = Σ f(r) 
r=0 


π γ.πὶ 
a 2, nin — pice 
< πὶ TyAn-T 
= 2. στα ΞΡ, 
π np\(n — 1)! _ ee 
“2 caye=a=eame ἃ 
and writing r—1=s, 
m1 (n— 1)! 


8η(π5--1 -- 8) 
mean of r = np >> ss Fee πα ἢ 
= nq + py? 
=np sineeq+p=1. 
Hence, mean of r = np. (40) 


Using similar algebraic manipulations, 
variance of r = o? = Σ rf(r) — p?, (41) 
r=0 
and writing r? = r(r — 1) + r in the first expression on the right, 


a2 Σ {( -- ἜΝ τὰ i 


“Sere See 
= = rir — 1) ce = oP 5 + np (from (40)) 
n (n - 2)! ΡΥ ΟΣ 
ὩΣ Go DG - αν Ὁ ΠΣ: 1)p?p gi )—(r-2)} 


+ np. 
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Writing r — 2 = s in the summation, 


x “ — 2)! 
ΣΟ 2 re erat a — lp? + np 


= n(n — 1)p*(q + ρ)" 5 + np 
= πίη — 1)p? + np. 
Finally, substituting in equation (41), 
o? = n(n — 1)p? + np — np? 


= πρᾷ — p) 
= pq. 
Hence, for the binomial distribution, 
mean = np (42) 
variance o? = npq (43) 


Example (i) A machine makes certain products which are known to be 
on average 1 per cent defective. What is the probability that a box of 
one hundred of these products contains (a) exactly 3 defectives, (b) at 
least 3 defectives? 


Solution Let r be the number of defectives in a box of 100, then r has 
a binomial distribution with n = 100, p=001, ῳ = 0-99, 


(a) Pir = 3} = f(3) 
» (ἢ (001)3(0.99)51 
= 0-060 5 
(Ὁ) Pir > 3} = P{r=30rr=4orr=Sor.. . or r = 100} 
= 1— P{r=0 or 1 or 2} 
= 1-0) — fl) —f@ 


Ξε 1 -- (ἢ (0-99)200 — [1] (0Ο1)(0.99)99 


- ‘ey, (0-01)2(0-99)%8 


= 1 — 0:366 44 — 0-370 13 — 0-185 065 
= 0-078 36 
= 0-078 4. 


In the calculation (0) = (0-99)!°° was calculated using five-figure 
logarithms, f(1) was then calculated as 100f(0)/99 and f(2) as $f(1), ie. 
each term of the binomial distribution was calculated from the previous 
term. 

The mean number of defectives in a box of 100 is np = 1 and the 
variance is npg = 0-99. 
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Example (ii) If the probability that a target will be hit on a single 
bombing run by an aircraft is 0-1, find the probability that on 20 such 
runs the target will be hit at least twice. 


Solution In this example, p = 0-1, 4 = 0-9, and n = 20, so that 
Plr >2} =1— P{r < 2} 

1— f0) — αὶ 

ΞΕ 1 -- (0) (0-1)°(0-9)?° — @ (0-1)(0-9)?9 

1 — 0-121 56 — 0-270 13 

= 0-608 3. 


I 


15.8 THE POISSON DISTRIBUTION 


Suppose r has a binomial distribution f(r) = @ pq" and that the 


number of trials n is very large, then the computation of f(r) becomes 
very lengthy. There are two standard frequency functions which give a 
good approximation to the binomial distribution when n is large. The 
first is the Poisson distribution in which p is also very small so that np 
remains finite. 

To derive the Poisson frequency function, consider n— co and 
p—O0 in such a way that mp = p is a finite constant. Expand the 
binomial coefficient (") and multiply numerator and denominator by 


n’ to give 
r factors 
-ς---------.---.. ---;᾽ὡ--Ψ- -- --- - -.- 
1 πίη - 1) -- 2). . «(ἡ πρὶ 
ἡ ee ΞΟ οἱ ars 


Since 4 = 1 ~— p, mp = p, and q = 1 — u/n, this expression becomes, 
on substituting, 


rom Be-( (0-9 (He (8) 


It can be proved that 


so that, as n> οὐ and p— 0, the value of f(r) approaches (μ΄ e~*)/r!. 
Hence the Poisson distribution is given by 


fir) =F. (44) 
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15.8.1 Mean and variance of the Poisson distribution 
Using definition (27), with x = r; p = f(r), then 


mean of r= Srfir)= Σ eae 


35 FA 


=e #4 5 a ie 
mi(r— 1)! 


prot 
μ 
τὰς ἢ «> Gor 


Writing r — 1 = s in this summation gives 


mean of r = e~ ΣΕ 


= e-4u e# 
τ--- μ. 
Variance of r = σξ = Σ r? f(r) — μϑ (45) 
and > r? f(r) =f > r2e7# Ει 
r=0 r=0 Γ. 


Set γ3 = r(r — 1) +r, so that 

LP f= Σ γί -- NewS + FS κοτε 

r=0 r=0 ry 0 r. 
wn? 0 prt 


soe SA Soe Hee ay, 


Writing r — 2 = 5 in the first summation and r — 1 = ¢ in the second 
summation, 


Srna es penn SH 


6," we 6" + er u e# 
= w+ m. 
Substituting in equation (45) gives 
variance of r = μ. 
Hence for the Poisson distribution, 


mean = variance = μ. (46) 
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15.8.2 The Poisson distribution as an approximation to the binomial 
distribution 

The number of happenings of a rare event in a large number of trials 

approximates to a Poisson distribution. The approximation is fairly 

good when» > 100, p < 0-05. Figures 15.9 and 15.10 show the binomial 

and Poisson distributions when μ = 4, p = 0-4, and p = 0-04 respec- 

tively. 


na 
ο 1 2 3 4 5 6 7 8 9 10 


= 
Oo i @ 6. 4 5» δ᾽ 7 ὃ 5. ἴδ 


Example (i) Suppose that example (i) of section 15.7.1 is evaluated 
using the Poisson distribution instead of the binomial distribution. 


STATISTICS I 423 
Solution Here n = 100, p = 0-01, μ = 1, and 
Sr) = ὁ “ pr! 
Then (a) Pf{r = 3} = 3) 
= e~1(1)9/3! 
0-061 3; 
and (b) Pir > 3} = 1 — Ὁ — fd) -- Ὁ) 
=l—e"W+14+%h 
= 1-0-9197 
== 0-080 3. 


Example (ii) Boxes contain 200 articles known to be on average | per 
cent defective. The manufacturer guarantees that in a box of 200 there 
will be fewer than three defectives. What is the probability that the 
guarantee will be violated? 


Solution The number of defectives in a box of 200 has a distribution 
which approximates to a Poisson distribution since n = 200 is large, 
} = 0-01 is small and np = μ = 2. 

Let r be the number of defectives, then 


SQ) =e"? 2"/r! 
Hence, approximately 
Pir <3} Ξξ- ει Ὁ +2 + 27/2!) 
τ 56:3 
= 0-676 7. 


Exactly, using the binomial distribution and a longer calculation, 


2. (2 
Pr<3}=> ( ) (0-01)'(0-99)200-+ 
r=Q 
= 0-679 78. 


Hence the probability that the number of defectives in a box is three 
or more is 0-323 3, and the probability the guarantee is violated is 
0-323 approximately, or 32:3 per cent. 


15.8.3 The Poisson distribution as an independent distribution 


If events occur at random at a constant rate in time or space, it can be 
shown that the number of events in a given time interval or in a given 
region has a Poisson distribution. The assumption is made that the 
number of events occurring in say, a given time interval is independent 
of the number that occurred in earlier intervals. Consider an example 
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of an observed distribution that can be thought of as having a Poisson 
distribution. Table 15.5 gives the observed distribution of «-particles 
emitted in 6 seconds. The variable x is the number of «-particles 
emitted per 6-second interval and x was observed in 2500 such 
intervals. The mean x for the observed distribution is 3-76. If this is 
taken as an estimate of the mean of a Poisson distribution, u = 3-76, 
then the corresponding frequencies would be expected to be 


2 500 ε 3:18 (3-76)'/r!, r=0,1,2,.. 10. 


The results of computing these values are given in Table 15.5. 


x 0 1 2 3 4 5 6 7 8. 9 310 
observed 
frequency | 71 210 384 515 501 383 231 121 46 23 = 15 
expected 


frequency | 58-2 218-8 411-4 515-7 484-7 364-5 228-4 122-7 56-7 24-1 13-8 
Table 15.5 


There is good agreement here. Had it not been so, the Poisson 
distribution as the mathematical model for this observed distribution 
would not have been acceptable. Other examples where the Poisson 
distribution has been found to be a good mathematical model are (i) the 
number of calls arriving at a telephone exchange in a given interval of 
time, (ii) the number of cars passing a given point of a busy road in a 
given interval of time, (iii) the number of blood cells in a given area say 
1 cm? of a haemocytometer, (iv) bacteria counts, (v) stars in space, and 
in many more similar cases. 


15.9 THE NORMAL OR GAUSSIAN DISTRIBUTION 


The binomial and Poisson distributions are both examples of theoretical 
distributions of a discrete random variable. The normal distribution is 
a continuous distribution whose characteristics are often found in 
practice and hence it serves as a mathematical model for these distribu- 
tions. It has the frequency function defined by 


1 
fis) = aes xp |- 5: ΕΣ (47) 
where yz and o? are constants, 

The characteristics of the graphs of y = f(x) defined by equation (47) 
are that all are bell-shaped and symmetrical and die out quickly at the 
tails. A typical normal distribution is shown in Fig. 15.11. 

It can be shown that 


[ f@) dx = 1 for all values of « and σ (48) 
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Fig. 15.11 


15.9.1 Mean and variance of the normal distribution 
It can be shown that 


mean = a x f(x) dx = p, (49) 


and variance = [ (x — p)? f(x) dx = σϑ. (50) 


The integrations (48), (49), and (50) are beyond the scope of this 
book. 


15.9.2 The standardized variate 

Usually in problems the values of μ᾿ and o are known or can be found. 
Probabilities, corresponding to the areas under the curve y = f(x) 
between given limits are required, and these are tabulated or, in some 
cases calculated, for μ = 0, o? = 1. If the random variable ¥ has a 
normal distribution with μ = 0 and o? = 1, then YX is said to have a 
standardized normal distribution. If x has a normal distribution with 
mean y and variance σ΄, this will be written x = N(u, 0”). Changing 
the origin and scale so that the new mean is zero and the new variance 1, 
ie. writing y = x — p gives 


mean of y = mean of x —-n=yu—pyw=0, 
and writing X = y/o = (x — p)/o gives 
var (X) = var (y)/o? = var (x)/o? = o?/o? = 1, 
i.e. X = (x — p)/o = NO, 1). (51) 


The graph of y = f(X) = [exp (—4X°)]/+/(2z) is sketched in Fig. 
15.12. It is an even function since f(—X) = f(X) and hence is sym- 
metrical about the y-axis. 
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The distribution function F(X) is defined by 


x 
F(X) = [40 dt = τῖς ie exp (—4t”) dt (52) 


(see equation (23) of section 15.3.2). This gives the area under the curve 
y =f(X) from —o to X and represents the probability that X is less 
than some fixed value. 

Also, from equation (22) of section 15.3.2, 


[pepax = P< x< δ). 


Fig. 15.12 


It is possible to evaluate the integral on the left between finite limits, 
using the numerical methods of Chapter 14. In particular, using 
Simpson’s rule of section 14.8.3 with ἢ = 1, 


4 
πὶ [ exp (--ξ ΧὉ) dX = 0.497 5 =0°5. (53) 
0 


It is not necessary to evaluate the integral each time it is required, since 
tables of the distribution function F(X) are published, for example, in 
Cambridge Elementary Statistical Tables by D. V. Lindley and J. C. P. 
Miller. (It is assumed that the reader has a copy of these, or similar, 
tables.) The function F(X) defined by equation (52) is tabulated for 
values of Y from X¥ = 0 to X¥ = 4-0. From the tables, when X = 0, the 
value of F(X) is 0-5, and this is understandable, since it represents the 
area from the left tail to the centre, or half the area under the curve in 
Fig. 15.12. Compare this with the result (53) which gives the area under 
the curve from X = 0 to ¥ = 4, which is approximately half the total 
area. 
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As X increases towards the value 4, F(X) increases towards the value 
1, or the total area under the curve. When YX = 0-67, then 
F(X) = 0'748 60-75 using tables. Hence, between ¥ - Ὁ and 
X = 0-67 there is approximately (0-75 — 0:5) = 0-25 of the total area; 
or, because of symmetry, approximately half the area lies between 
X = +0-67. The area under the curve between Y = 1.96 is 
2(0:975 — 0:5) = 0-95, or about 95 per cent of the total area lies 
between X = 41:96. 


Example Let x be M(1, 4). Find the probability that x is less than 3. 


Solution Given x = N(1, 4), write ¥ = (x — 1)/+/4 then X¥ = N(Q, 1) 
and 
P{x < 3} = P{X < 18 -- 1)} = P{X¥ < 1} = 0-841 


using tables to find the final answer. 


15.9.3 The normal distribution as an approximation to the binomial 
distribution 

When n is large and p is small the Poisson distribution gives a good 

approximation to the binomial distribution. When Μ is large and p is 

not necessarily small, the normal distribution is a good approximation, 

To show the degree of approximation consider the following two 

numerical examples. 


Example (i) A given discrete variable has a distribution with mean 12 
and variance 4, What is the probability that it is 14 or more? 


Solution Here np = 12, npq = 4 so that 4 = 4, p = 3, n= 18, and 
the binomial distribution is 


= (760) 


The values for r = 0, 1, 2, 3,. . ., 18 are given in Table 15.6, and the 
histogram of these values is drawn in Fig. 15.13. The computation of 
values of f is made slightly easier by computing each term after the 
first from the previous term, so that 


SO) = (ἢ = bo, 
f() = 18 x 24 = ἢ, 


18.17.16... .(18 —r +41) /2\"/1\28-" 
ands ΟΞ i ee eer (5) (G) 


a ACE 


Da 
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This method was used to find the values of fin Table 15.6, the values 
then written correct to four places of decimals. 


r 4 5 6 7 8 9 10 1 
f@ | 09-0001 0-0007 0-0031 0-0105 0:0289 0:0643 0-1157 0168 2 
r 12 13 14 15 16 17 18 
f(r) | 0-1963 0:1812 0:1294 0:0690 0:0259 0:0061 0-0007 

Table 15.6 


The required probability is given by 
18 
Pir >19= > fv) =0-2311, 
r=14 


and this is represented by the area of the histogram in Fig. 15.13 to the 
right of AB. 


13%2 14V2 
Fig. 15.13 


The normal approximation to the area of the rectangle ABCD is the 
area under the corresponding normal curve, or EBCF, i.e. the area 
under the normal curve from 13} to 143. To evaluate the required 
probability using the normal approximation to this binomial distribu- 
tion it is necessary to evaluate the area to the right of 134. The continuity 
correction of 4 must always be allowed for. In this example, first 
standardizing the variate and then using tables, 


% = 412 


/4 a(x = 12), 


X= 
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and the required probability, 
P{x > 14} = P{X > 41134 — 12)} 
= P{X > 0-75} 
= 1 -- Ο:773 4 
= 0:226 6. 
The approximation is good for such a small value of n. 


Example (ii) A random sample of 100 screws is taken from a batch 
which is known to be 3 per cent defective. Calculate the probability that 
the sample will contain (a) more than 5 defectives, (b) between 2 and 5 
defectives. 


Solution Here n = 100, p = 0-03, 4 = 0-97, so that μ = np = 3 and 
σῇ = npg = 2:91. 


Let x be the number of defectives per sample of 100, then x is N(3, 2:91). 
Let X be the standardized variate, so that 

X = (x — 3)//(2:91) = (x — 3)/1-706, 
and hence X is N(0, 1). Then, 


(a) the probability that a sample contains more than 5 defectives is 
given by 


P{x > 5} = P{X > (44 — 3)/1-706} 
= P{X > 0:879 4} 
= 1—P{X < 0879 4} 
= 0-189 6; 


(b) the probability that a sample contains between 2 and 5 defectives 
is given by 
P{2 <x < 5} = P{l'5 — 3)/1-706 < X < (5:5 — 3)/1-706} 
= P{—0:879 4 < X < 1.466) 
= 0-928 7 — (1 — 0:810 4) 
= 0-739 1. 

It can be proved that if x represents the number of successes in n 
independent trials of an event where p is the probability of success in 
a single trial, then x —> N(np, npq) as n increases. 

It is sometimes more convenient to work with the proportion of 


successes, x/n in n trials where p is the probability of success in a single 
trial. Then 


x/n - N(p, pq/n) as n increases. 
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15.9.4 Summary 


In any given problem (i) if 7 is small the binomial distribution, (37) 
of section 15.7 is used, (ii) if m is large and p is small the Poisson 
distribution, (44) of section 15.8 is used, and (iii) if πὶ is large and p 
is either small or large the normal approximation may be used. 
The normal distribution is the easiest to use since this is tabulated for 
the standardized variate. The tables used in this chapter tabulate the 
function 


ι FR och 
Vn) Ϊ Ἐν θα 


which is the probability that the standardized variate is less than X. 
Other tables give values of 


LD. ἡ ~307 J 
coal Bar ae 


which is the probability that the standardized variate lies between 0 
and X. 


15.10 SOME USEFUL THEOREMS 


Let x and y be two independent random variables, i.e. the probability 
that x takes any set of values is unaffected by y and the probability 
that y takes a set of values is unaffected by x. Then, 


(i) if each value of x is multiplied by a constant c the mean of the 
resulting variable is c times the mean of x, i.e. 


(cx) = cx, 
(ii) the mean of the sum or difference of the variates x and y is the 
sum or difference of their means, i.e., 
A YH ΧΡ, 


(iii) the variance of the sum or difference of the variates is the sum of 
their variances, 1.6. 


o%(x + y) = o%(x) + 0°(y), 
(iv) if each of x and y has a normal distribution then (x + y) has a 
normal distribution. 


The proofs of these theorems will not be given here and may be found 
in textbooks of statistics. 


EXERCISE 15 
1 A box contains 4 white and 2 red marbles. One marble is drawn 
from the box. What is the probability that the marble selected. is 
(i) white, (ii) red? 
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2 What is the probability of obtaining the score (i) 6, (ii) 4 on rolling 
one die? What is the probability of obtaining a total score (iii) 12, 
(iv) 7 on rolling 2 dice? 


3 Find the probability that in 4 hands dealt from an ordinary pack 
of 52 playing cards each hand contains one ace. 


4 Find the probability that at least one head is thrown when 10 coins 
are tossed. 


5 A single die is rolled until a six is obtained. What is the probability 
that a six will be obtained (i) on the third attempt, (ii) on the mth 
attempt? 


6 Two different digits are chosen at random from the set 1, 2,3,. . .,9. 
Find the probability that (i) the sum of the digits is exactly 6, (ii) 
the sum of the digits is greater than 14, (iii) both digits are greater 
than 6. 


7 The probability that one particular component of a machine needs 
replacing in a 5-year period is } and that of another is $. Assuming 
that these two events are independent, find the probability (i) that 
either one or the other needs replacing, (ii) that both components 
need replacing, (iii) that neither component needs replacing, in the 
period. 


8 A and B take turns in throwing a pair of dice. A starts and the first 
to throw a total score of 9 wins. What are the respective probabili- 
ties? 

9 A single die is rolled. What is the probability that the score is 3 if 
it is known that the score is odd? 


10 A box contains 3 black and 2 red balls marked 81, B2, B3 and Rl, 
ΚΖ. Two balls are drawn in turn from the box without replacement. 
Find the probability that (i) both balls are red, (ii) both balls are 
of the same colour, (iii) a total score of 4 is obtained, (iv) the total 
score is 4 if it is known that both balls are black, (v) the first ball 
drawn was black if it is known that the second one drawn is 8]. 

11 A random variable x has the frequency function f(x) = οχϑ 7, 
x 20. Determine (i) the value of c, (ii) the distribution function 
F(x), (iil) P{x < 1}, (iv) P{l < x < 2}, (v) the mean and variance 
of x. 

12 A random variable has the frequency function f(x) defined by 
fx) =0, x<0; fxh=cx, 0<x<1l; fWMect+I1—--x, 
1<x <2; f(x) = 0, x > 2. (i) Evaluate c, (ii) find the distribution 
function F(x) and sketch the graphs of f(x) and F(x). 

13 Use coding to find the mean and standard deviation of the frequency 
distribution given in the following table. 

x | 2:30 2:35 2.40 2:45 2:50 2:55 260 2:65 2:70 2:75 
tf 1 5 6 13 8 17 14 7 1 3 
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14 Examination marks are scaled either (i) by subtracting 10 from 
each mark or (ii) by subtracting 10 per cent of each mark. How do 
these forms of scaling affect the mean and standard deviation of 
the set of examination marks? 


15 Sketch the histograms of the binomial distributions in which p = ὁ 
and (i) n = 5, (ii) n = 10, (ili) n = 25. 

16 Experience shows that 25 per cent of a new seed germinates. If 
1 024 packets each containing 5 seeds are tested, find the number of 
packets in which it is expected that 0, 1, 2,. . ., 5 seeds germinate. 
Sketch the histogram. 


17 A firm makes a certain kind of switch and produces 0:5 per cent 
defective. They are packed in boxes of 100. What percentage of 
boxes (i) has no defective switches, (ii) has 2 or more defective 
switches? Use (a) the binomial and (b) the Poisson distributions. 


18 Fit a Poisson distribution with the same mean to the following data: 
Number of faults 0 I 2 3. 4 5 
Frequency of faults 18 28 25 17. 9 3. 


19 The number of telephone calls received by an operator between 
8.30 and 8.35 a.m. fits a Poisson distribution with mean 3. Find the 
probability that (i) in any given day there will be no calls in that 
interval of time, (ii) in 3 given consecutive days there will be a total 
of 2 calls in that interval of time. 


20 A manufacturer knows that 15 per cent of his product is defective. 
It is inspected in batches of 60 units. Find the mean and variance 
of the number of defectives in batches of 60 and use tables of the 
normal distribution to calculate the approximate percentage of such 
batches which contain more than 10 defectives. 


21 On the same axes sketch the normal curves of f against x for one 
hundred trials of an experiment when x is (i) N(O, 1), Gi) NO, 4), 
(iii) N(O, 100). What percentage of the total area in each case lies 
between +0°670, +o, +1:960? 

22 The resistances of 200 similar resistors of nominal value 1 000 ὦ 
were measured to the nearest ohm and their values classified in the 
following table. 


995 996 997 998 999 1000 1001 1002 
5 0 13 12 12 22 13 17 


Resistance x 
Frequency f 


Resistance x | 1003 1004 1005 1006 1007 1008 1009 
Frequency f 19 20 16 18 15 12 6 

Find the mean resistance and the standard deviation. Assuming a 
normal distribution with this mean and standard deviation find the 
probability that any one resistor falls outside the manufacturer’s 
guarantee that the values of the resistances are within +1 per cent 
of their nominal value. 


16 
Statistics Il 


16.0 INTRODUCTION 


Broadly speaking, the aim of most statistical enquiries is to infer 
properties of a population from a sample from that population. The 
population corresponds to the frequency function, the sample corre- 
sponds to an observed frequency distribution. If the number of observa- 
tions is large, relative frequencies of the observed frequency distribution 
tend to probabilities of the frequency function. 

Thus, given an observed frequency distribution, the problem is to 
select a theoretical (mathematical) model, to check that this model is 
reasonable, and finally to draw conclusions about the population. 

There are three types of statistical problem to consider. The first is 
the estimation of parameters from observed frequency distributions, 
for example, the estimation of the value of the mean or variance of a 
population from the data from a sample. The second is the testing of 
hypotheses, which leads to significance testing. Finally, one test that 
the model chosen is a reasonable model will be given. 


16.1 METHODS OF SAMPLING 


A random sample is a sample from a population in which each individual 
member of the population has the same chance of being selected. 
Simple sampling is random sampling where the probability of selecting 
any individual remains constant throughout the sampling. If the popu- 
lation is very large or can be thought of as infinite, then every random 
sample is simple. If the population is finite, simple samples are obtained 
only if sampling is with replacement, i.e. after the selection of an 
individual it 15 replaced in the population. From now on in this chapter 
all samples are simple. 


16.1.1 Estimation 


In most cases a good estimate of an unknown population parameter is 
obtained by computing the corresponding quantity for the observed 
frequency distribution, so that, to estimate the population mean yu the 
mean X (or m) of the observed distribution is used, to estimate the 
population variance o” the variance 52 of the observed distribution is 
used. That is, 


estimate μ᾿ by X = (2n,x,)/n, 
estimate o? by 52 = [in,(x, — X)?]/n, 
where xn, = n = total number of observations. 
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16.1.2 Definition 


Any quantity computed from the sample and used to estimate an 
unknown population parameter is called a statistic or estimator. 


16.2 SAMPLING DISTRIBUTION OF THE MEAN 


Suppose a sample of n observations x1, X2, X3,. . ., X, ἰδ taken from a 
population, then the sample mean is, say, X, = (&x,)/n. Suppose a 
second sample is taken from the same population, then this will have 
a sample mean x, which may be different from X,. A third sample 
under the same conditions will have mean X3, and so on. That is, the 
value of x varies with the sample, and the sample means may be 
tabulated as in Table 16.1. 


Sample number 1 2 3 sete κ 
Sample mean Xy Xe X3 . Xe 
Table 16.1 


Thus X is not constant over all samples but has a distribution, and the 
purpose is to find the frequency function for x. This is called the 
sampling distribution of the mean. 

Suppose the population has mean u and variance o?, and a random 
sample x1, Χο, Xs, . - -, X, is taken from this population. The sample 
mean αὶ = (2x;,)/n. Since the sample is random, xj, ΧΩ, X3,. . .) Χῃ Can 
be interpreted as n independent random variables. Also the mean of 
x; = wand the variance of x, = o? for all x,, since each is drawn from 
the same population. Then, 


(mean of x) = (mean of x, + mean of x, +... + mean of x,)/n 
=(utut...+yp)n 
= nyln = μ 
or, mean of x = population mean y; 
(variance of %) = var (χ + x2 +%3 +... +%,)/n? 
== [var (x1) + var (x2) + var (xg) +. . . + var (x,)]/n? 
SINCE X1, X2, Xz, . . .» X_ are independent. 
Hence var (X) = (0? ++ σῇ +o? +... +4 .67)/n? 
= no?/n? 
= o7/n. 
Hence mean (X) = yw, var (X) = o7/n. (1) 


If the population has a normal distribution, nX = x, + x2 + ΧΆ 
+...+ x, is also normal, i.e. 


nx = N(np, no?), 
or X = N(u, o/n), (2) 
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and the sampling distribution of the means is also normal. The standard 
deviation, which is the square root of the variance, of a sampling 
distribution is called the standard error, (S.E.), so that S.E. (%) = o/4/(n). 
The distributions of x = Νίμ, 1) and of ¥ = N(y, 3) are sketched in 
Fig. 16.1. 


Fig. 16.1 Normal distributions for x, σ = 1 and X, σ = } 


If two independent samples from two different populations are being 
considered, then 


S.E. ( + J) = s/{var (ἃ + ¥)} 


= s/{var (x) + var (j)} 


where σῇ is the variance of the population of x’s and n, the sample size, 
o2 the variance of the population of y’s and 2 the sample size. 


16.2.1 Population parameters estimated from a sample 


It is necessary to estimate the population parameters μ and o* from a 
sample from the population. First consider an estimate of the popula- 
tion mean μὶ to be the sample mean X. From equation (1) of section 16.2 
the mean of X is μ and the variance of X is o?/n, so that the distribution 
of sample means is ‘narrower’ than the distribution of the population. 
This is shown in Fig. 16.1 with o= 1, n = 4. Hence X has a high 
probability of being near to μ. Moreover as the sample size increases, 
% approaches the value μ. Hence the mean of the population is estimated 
as the mean of a sample. That is 


B=X (4) 
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The sample variance 55 can be shown to be the fraction (1 -- 1)/n 
of the population variance, that is, it is too small. Hence for small 
samples the variance ns?/(n — 1) is used to estimate o”, and since 


ns? 1. 
τος το τθος aie 

n-1 nai 2, x); 
the estimate of o? is evaluated by dividing the sums of squares by 
(n — 1) instead of n. For large values of n the factor n/(n — 1) is not 
important. Hence the variance of the population o? is estimated from 
the sample as 

n 1 


2 2 
or = sc 
n—-1 n—-1 


2 (x; — Χ)3, ἢ small, (5) 
σϑ = s?,_n large. (6) 


16.3 TESTING HYPOTHESES AND CONFIDENCE 
INTERVALS 


A fairly general definition of a statistical hypothesis is that it is an 
assumption about the frequency function of a random variable. A test 
of a statistical hypothesis is a test to decide whether to accept or reject 
the hypothesis. In problems considered in this section, the object is to 
test an assumed statement, called a null hypothesis (N.H.) to see if it 
is acceptable. 


16.3.1 Significance test based on the normal distribution 


This test depends on knowing that the population being tested is 
normally distributed, on knowing the variance, and on having a reputed 
mean of the distribution. 

Suppose a sample of individuals from a given population has mean 
X, the general procedure for testing this mean is as follows: 


(a) make the null hypothesis that the true mean is p, 

(b) note the difference between the sample mean X and y, 

(c) find the theoretical probability that in random sampling the differ- 
ence between X and μι is greater than the observed difference, 

(d) if this probability is sufficiently small, reject the null hypothesis. 
That is, conclude from the given data that this sample is not from the 
given population whose mean is μ. 


An example to demonstrate the procedure follows. 


Example A firm is producing packets whose weights are known to be 
normally distributed with variance 0:25 gramme’, and the mean weight 
is supposed to be 10 g. A random sample of 10 packets is taken and 
the sample mean is found to be 9-64 g. Is the firm giving correct weight ? 
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Solution Let the weight in grammes be x. The sample mean < is 9-64, 
the number of items in the sample x is 10. Following the procedure for 
testing this value of x: 


(a) the N.H. is that the population mean μ is 10; 

(Ὁ) the sample mean X differs from yu by 0:36; 

(c) the probability that in random sampling ¥ would be such that 
|X — μ| > 0-36, i.e. the probability that |x — u| > 0-36 by chance is 
found by standardizing the variate and using tables of the normal 
distribution as follows. Let 


X= (ἃ -- μ)(σ! νὴ) 
—0:36 +/10/0°5 
= —2:28, 
then X = N(0, 1), and from tables, 
P{|X| 5 2:28} = 2P{LX 3 2-28} 

= 2[1 — P{X < 2:28}] 
= 2[1 — 0-988 7] = 0-022 6 
<0-05 or 5 per cent. 


(d) The critical probability level is usually taken at 5 per cent, and the 
test shows that x differs significantly from μ at the 5 per cent level, and 
the null hypothesis is rejected. Hence it is not by chance that the 
incorrect weight is being given. 


Values of the standardized variate X and the corresponding proba- 
bilities are given in Table 16.2. 


P 20% 10% 5% 2% 1% 015 
[|X| > 1 1-282 1-645 1960 2:326 2:576 3-291 
Table 16.2 


The general procedure for testing means in normal populations is 
now re-written as follows: 


(i) make the null hypothesis that the true mean is μέ, 

(ii) compute X = (ὦ — )/(a/+/n), where o? is the variance and n the 
number of observations in the sample, then X¥ = N(0, 1), 

(iii) if |X| > 1-96, then X differs significantly from μ at the 5 per cent 
level and the N.H. is usually rejected, 

(iv) if |X| > 2-576, X differs significantly from μ at the 1 per cent level 
and the N.H. is always rejected, 

(v) if |X| < 1:96 there is no reason to reject the N.H. 


It is not possible to prove that the null hypothesis is true. If |X| < 1-96 
the sample values are consistent with the null hypothesis, or there is no 
evidence from the given sample to reject the null hypothesis. 
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The above test is a two-tailed test in which the probability that |X| 
is greater than a fixed number is given. A one-sided test in the above 
example would be say, a test for short weight and 

P{X < —2:28} = P{X > 2-28} 
= 1 — 0-988 7 
= 0-011 3 
<5 per cent level. 


16.3.2 Confidence intervals (interval estimation) 


Assume that the population mean yw is unknown and is estimated by 
the sample mean ¥. Then the 95 per cent confidence interval for μ is that 
interval such that 


Ρίμ — 1-960/s/n < %¥ < pw + 1:960/4/n} = 0-95 

ie. P{E — 1:960/4/n < p< ¥ + 1:960/+/n} = 0-95 
since if X is within + 1-960/4/n of y, then μι is within + 1-960/4/n of X. 

In the above example ¥ = 9-64 and the interval becomes 

9-64 — 1:96 .0-5/4/10, 9-64 + 1:96 . 0-5/4/10 


which is 9-33 to 9-95. 

A confidence interval for any parameter may be thought of as the 
values between which that parameter must lie for a test of significance 
to be non-significant. In terms of the mean, with standard error o/4/n, 
as n increases, o/./n decreases so that the width of the confidence 
interval decreases and accuracy increases. 


Example (i) The values of resistors produced by a given machine appear 
to have a normal distribution with mean 29 Q and variance 1-44 Q2. 
The process is modified in such a way that the variance is not affected 
but the mean may be. A random sample of 10 resistors is taken and 
the values in Q are 


30°31 31-14 30:14 30:22 27:87 
28°37 31-37 29-70 29-07 29-41 


Has the mean changed? 


Solution The N.H. is that there is no difference in the mean, or that 
the mean yp is 29. The sample mean < is 


29 + (1:31 + 2:14 4... +0-41)/10, 


or X is 29-76. 
The standardized variate X is given by 


129-76 — 29|  0-764/10 
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This is greater than 1-96 and hence the ΝΗ. is rejected. That is, x 
differs significantly from μ at the 5 per cent level and modification has 
tended to increase the mean. 

It is of interest to note that the 95 per cent confidence limits for the 
new mean are ἃ — 1:960/4/n, Χ + 1:960/4/n which are 29-02 and 
30-50. The interval between these limits does not contain the value 29, 
of the old mean, and this was to be expected from the result found. 


Example (ii) The classic blindfold drinking test is based on the binomial 
distribution and the technique of testing is the same. A person claims 
that he is able to distinguish between two drinks A and B. In an experi- 
ment he is correct 5 times in 6 trials. Does this result justify his claim? 


Solution The N.H. is that the person guesses, i.e. that he is equally 
likely to choose A or B, i.e. that the number of correct guesses has a 
binomial distribution with p = 4, q = 4. Let r be the number of 
successes, then from the binomial distribution, with n = 6, 


6\ /1\"/1\*" 
r= ()0} 6) 
ΒΤ 
~ Ar) 12) 
The mean number of successes is given by mp which is 3, and the 


observed number of successes is 5. Hence the difference is 2. The 
probability that this difference is greater than or equal to 2 is given by 


P{|r — πρὶ > 2} = ΚΟ) + ΧΙ) +f) + FO 
= @°1+6+6+13 
= 14/64 
= 0-22. 


This probability is greater than 5 per cent, and hence there is no reason 
to disbelieve the null hypothesis that the person guesses. 


16.4 LARGE SAMPLE THEORY 


If it is known that a population is normally distributed, then the 
distribution of the means of samples from the population is also 
normally distributed. If the population distribution is N(u, 07), then 
the sampling distribution of the mean is N(u, o7/n). 

If the population is not normal, it is still true that the mean of the 
distribution of the means x is u and its variance is o?/n, but X is not 
normally distributed. It can be shown that the distribution of the means 
& tends to normality as ἢ increases. This is known as the Central limit 
theorem which may be stated as follows. 
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Let x1, X2, Xa, . . -» X, be m independent random variables from the 
same distribution (not necessarily normal), with mean (x,) = p, 


variance (x,;) = o, for all values of i. If ἃ = (3x) / n, then the 


t=1 
theorem states that, as ἢ -- οὐ, (ὦ — u)/(a/4/n) > NO, 1). 
The distribution of most estimators computed from a sample of size 
n approach normality as n increases. 


16.4.1 General procedure for large sample testing 
A large sample is one which contains more than about fifty observa- 
tions, and the procedure for testing a parameter 6 is as follows: 
(i) make the null hypothesis that the value of the parameter is 0, 
(ii) compute an estimate of # from a sample, say 4, 
(iii) evaluate the standard error of 4%, say so, 
(iv) compute the standardized variate X given by 
|X| = | — 4|/s0. 
Then, 
if |X| > 2, corresponding approximately to 5 per cent significance, the 
N.H. is usually rejected, 
if |X| > 3, corresponding approximately to 1 per cent significance, the 


N.H. is always rejected, 
if |X| < 2, there is no evidence from this sample to reject the N.H. 


16.4.2 Testing the mean of a large sample 


Let a sample of n (large) observations have values x1, X2, X3,- - +5 Xn 
and let the problem be to test whether the population from which the 
sample is drawn has mean μ. The population is not necessarily normal, 
and the variance is unknown. Using the procedure of section 16.4.1, 6 
now being the mean, 


(i) the ΝΗ. is that the population mean is yp, 
(ii) the sample mean X is (2x,)/n, 
(tii) the standard error of the mean is o/4/n and o is unknown. Hence 
o* is estimated from the sample by s? the sample variance, where 
55 = [U(x — ¥)?]/n 
= [Σχξ — ¥7]/n. 


(iv) The standardized variate X given by |X| = |X — u|/(s/+/n) is now 
tested. 


Example A sample of 900 observations is found to have a mean 3-5 
and standard deviation 2-62. Can it be regarded as a random sample 
from a population whose mean is 3-34? 
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Solution The N.H. is that the mean uw = 3-34, The mean of the sample 
X = 3-5 and the estimated standard deviation s = 2-62. Hence the 
standardized variate X is given by 


ya 253 
[ΧΊ = 563770900) 
. 48 
= 2.62 


and the result is obviously less than 2. Hence the result is not significant 
and there is no reason to suppose that this sample was not drawn from 
the population whose mean is 3-34. 

It is of interest to note that in this example, the 95 per cent confidence 
limits for the mean are ζ — [965 νη, X + 1-96s/4/n, which equal 
3-33, 3-67 (or αὶ — 2s/4/n, Χ + 2s/4/n, which equal 3-33, 3-67), and the 
interval between these limits includes the mean of the population. 


16.4.3 Testing the difference between two means 
Two problems are considered: 


(a) given two independent samples of n, and nz observations, they are 
tested as to whether they come from the same population, 


(b) given two samples from different populations, to test whether the 
means of the populations are the same. 


The procedure is as follows: 


(a) the N.H. is that both samples are drawn from the same population 
whose mean is w and whose variance is o*. 


Let X, be the mean of the sample size πᾳ and X, be the mean of the 
sample size "4. Then var (%,) = 0?/m, var (Χ2) = o?/n2 and using 
section 15.10 theorem (iii), 


var (X, — X2) = o?/n, + 07/n2 


td 
so that εξ if ( ss 1) 


ny Ne 


If o? is unknown a pooled estimate s? is used where 


ee 
nm + Μὰ 


[2@, — ¥)? + ΣΟ — ¥2)?] 


af nys? + ngs? 
Mm + nz 
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Also, since the mean of (X,; — ΧΩ) is zero, using section 15.10 theorem 
(ii), the standardized variate X is given by 


Ἢ \(%1 -- X2) — Ο] 


od lic tal 


(b) The N.H. is that the means of the two populations are each equal 
to uw. 


Xx 


and X may be tested as usual. 


Let the sample means be ¥, and ΧΩ of samples size m, and nz respectively 
from populations whose variances are σὲ and o%. Then, 


var (ἃ: — Χ9) = var (X,) + var (x2) 


2 2 
τισι θὲ 


ny Ne 


2 2 
and S.E. (αι -- %) = /@ a <2), 


ny “No 
mean (X,; — ζ2) Ξε ἵὶ -- μ᾿ Ξξῦ, 


and finally 


which may be tested as usual. 


If of and of are unknown, they may be estimated from the samples by 
1 
s} = 204 -- χρὴ 
ny 


1 
and i= 7 2 — X2)?. 


These tests are exact if (i) the populations are known to be normal 
and (ii) the variances are known. If (i) holds and (ii) does not, it is 
better to use the ¢-test which is an exact test valid for samples of any 
size and is discussed in section 16.6.1. 


Example (i) The means of two random samples of 100 and 200 observa- 
tions are found to be 66-5 and 67-1 respectively. Can these samples be 
regarded as drawn from the same population whose variance is 6-25? 
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Solution The N.H. is that both samples are drawn from the same 
population, or, that there is no difference in their means. Then, mean 
(X, — X_) = 0, o? = 6-25, so that o = 2°5, and 


1 1 
__ 0-75 
“ Ve 


_ [δ — ¥2) -- ΟἹ 
7 S.E. 


_ 066 
~ 075 
<2 


Finally, Xx 


and there is no reason to reject the null hypothesis, and both samples 
could be regarded as having been drawn from the same population. 


Example (ii) A sample of 7 000 observations has a mean 46-9 with 
standard deviation 1-95 and a sample of 2 000 observations has a mean 
45:9 with standard deviation 2-02. Is the difference in their means 
significant ? 


Solution The N.H. is that the samples are drawn from different popula- 
tions with equal means, or, there is no difference in the two means. 
Estimate σῇ by s? = (1-95)? and σῇ by 58 = (2-02), then 


1953 2:02? 


= 0-050 8, 


x mas Xa| = 1, and 


1 
X= 00508 > 19. 
Hence the difference in the means is highly significant. The null hypo- 
thesis is therefore rejected. There is a difference in means. 


16.5 SAMPLING ATTRIBUTES 


Sometimes it is required to find the probability that an individual has 
a certain characteristic. For example it may be required to find the 
probability that an individual is a cigarette smoker, given the proportion 
of men in that age group who smoke; or the probability that a manu- 
factured part is defective knowing the proportion of defective parts 
produced by the machine making the parts. 
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The general method is to consider a sample of size n, and suppose 
that r out of n possess the attribute under consideration. Then r has a 
binomial distribution and 

‘ n 
P{r out of n have the attribute} = (") pq". 
The mean value of r is np, and to test the hypothesis that the probability 
of an individual having the attribute is p, r is compared to np, that is, 


the theoretical probability that |r — np| is greater than the observed 
difference is evaluated. If ro is the observed value of r, then 


P{|r — πρὶ > |r — πρὶ} is computed. 


16.5.1 Large sample test of a value of p 


For a large sample the normal distribution may be taken as an approxi- 
mation to the binomial distribution, and r is approximately N(np, npq). 
Then the procedure is as follows: 


The N.H. is that the probability any individual has the attribute is p, 
then the mean (r) is mp and the variance (r) is npg. Hence the 5.Ε, 
(r) = «/(upq), and the standardized variate X given by 


_ t= 
Vv (pq) 


is tested in the usual way. 


Example (i) A six-sided die is thrown 9 000 times and a 5 or a 6 is 
obtained 3 210 times. Is the die biased? 


Solution The N.H. is that the die is unbiased, i.e. 
p=PSor6=3, q=1—p=h. 
The expected number of (5 or 6) is np, i.e. 3 000. Then 


|r — np| = 210, 
and S.E. (r) = V(mpq) 
= 4/2000 
= 44-72, 
so that x= an 
= 4-7, 


This result is highly significant and the die is biased. 
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Alternatively, instead of working with r, it may be easier to work 
with the proportion 7 = r/n, then as 


r—> Np, npq), 
id N(p, pain), 
S.E. (7) = /(pq/n) 
ΒΕ ἤτον 
X= pain) 
—> NO, 1) 


and 


which may be tested as before. 


Example (ii) Using the data of the previous example and the proportion 
ar, find whether the die is biased. 


Solution Here 


2 
n= SO = 03567 
On the N.H. that p = 4 = 0-333 3, 
Ιπ — p| = 0-023 4 
and S.E. (7) = /(pq/n) 
= 0-004 97 
Hence Xi paces 
= 4-7, 


as before. 


16.5.2 Comparison of two large samples 

Suppose that there are two samples of m, and nz observations where 
n, and nz are large, and that in the first sample r, out of m, have the 
attribute A, and in the second sample γα out of nz have this attribute. 
Then it is possible to test whether these two samples come from the 
same population, or whether the probability, p, that an individual has 
the attribute A is the same for the two populations from which the 
samples are drawn. For, on the null hypothesis that p is the same for 
both populations, let πὶ = r,/nm, and 72 = r2/ng, then approximately, 


πι = N(p, pq/m), 
πὸ = N(p, pq/ne). 
Hence, 


1 1 
a cs Ν (0.04 (5, +5) 
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and the observed difference 7, — 72 is compared with the theoretical 
difference zero, using 


ye ee mal 

S.E. (7, — πα) 

If p, 4 are unknown, a pooled estimate ἢ of p is used, where 
.. Tu hs 
P Ny + Ne 
__ nym + Net 
ny + Ne 
and ἢ Ξ5 1 -- Ρ. 


Example Ina test involving the safe driving of nineteen-year old motor 
cycle owners driving approximately the same mileage, it was found that 
in one large city 600 out of a sample of 800 had no accident in one 
given year, while in another city in the same year, 700 out of a sample 
of 1 200 drivers had no accident. Do these results show that one city 
has a better safety record than the other for this class of driver? 


Solution The N.H. is that the proportion of safe drivers is the same for 
both cities. 
The relevant calculation is set out below: 
ny = 800, y= 600, πι -Ξῷ ὃ, 
ny = 1 200, n= 700, TT, = 7/12, 
B = (600 + 700)/(800 + 1 200) = 13/20, 


ᾷ = 7/20, 
πὶ — πὰ = 1/6, 
S.E. (7, — 72) = V/(13 x 7)/(20 x 20 x 480) = 0-021 77 
0-166 7 
and x= 0-021 77 > 7. 


The result is highly significant, and the null hypothesis is rejected. That 
is the proportion of safe drivers in the two cities is different. 


16.6 SMALL SAMPLE THEORY 

Tests of significance for a large sample are constructed by comparing 
the modulus of the difference between observed and expected values of 
a statistic with the standard error, since the distribution of most 
statistics tends to normality as the number of individuals in the sample 
increases. For small samples most distributions of statistics are far from 
normal, in particular estimates of parameters obtained from small 
samples are not reliable. For example, substituting the sample variance 
s? for the unknown population variance o? affects the distribution 
seriously. It is necessary to construct exact tests which are valid for 
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small samples (i.e. samples of between 5 and 50 individuals). A small 
sample test which is exact and which is valid for samples of all sizes is 
Student’s t-test, first derived by W. S. Gossett (1908) who wrote under 
the name ‘Student’. 


16.6.1 The t-distribution 


Suppose x,, i = 1, 2, 3,. . ., 2 is a random sample of n observations 
from a population whose mean is « and whose variance is o7. If the 
population is normal then the distribution of % is N(u, o?/n). If o? is 
unknown it is estimated by 


# = {U(x — ¥)7}/@ — 1), (7) 


using equation (5) of section 16.2.1. Then (% — 4)/G/+/n) no longer 
has a normal distribution, although its distribution tends to normality 
as n increases. The exact distribution of (¥ — y)/(S/+/n) is known and 
is called the t-distribution with n — 1 degrees of freedom. The n variables 
in equation (7) are not independent since they are connected by the 
relation X(x, — X) = 0, so that the number of independent variables is 
reduced from ἢ to (n — 1). 

The ¢-distribution is tabulated in Table 3 of Cambridge Elementary 
Statistical Tables, Lindley and Miller, and a short extract from this is 
given in Table 16.3 below. 


‘oni 
Number of degrees Significance levels 


of freedom 5% 1% 
1 12:71 63:66 
5 2:57 403 
10 2:23 3.17 
15 2:13 2:95 
fo) 1:96 2:58 
Table 16.3 


It is usual to write 
tha = (ἃ — Wi] Vn). 
Note that, as ἡ — οὐ, t, > N(0, 1). 


16.6.2 Testing a mean, variance unknown 


To test whether the sample of ” observations x1, X2, X3, .. ., Xq iS 
drawn from a normal population of mean y, the procedure is as follows. 
The N.H. is that the population mean is μ, the sample mean x is 
(2x,)/n, the estimate of the variance §? is given by 


35 = {2 — X77 — ἢ 


= οὐ 
n—1 n 
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and t,-1 = (ἃ — μ))(δ Μη) has a t-distribution with n — 1 degrees of 
freedom. 
Let t, be the « per cent point of the ¢-distribution with n — 1 degrees 
of freedom, i.e. 
ΡΞ ὸ = «/100. 


Then if the modulus of the observed value of ¢ is greater than or equal 
to ἕῳ the mean X differs significantly from yu at « per cent. The value of 
α is usually taken to be 5. 


Example A sample of ten observations from a normal population has 
values 


31-31 32-14 31-14 31-22 28-87 
29-37 32:37 30:70 30-41 30-07. 


Test whether this sample was drawn from a population whose mean 
is 30. 


Solution The N.H. is that the population mean y is 30. From the data, 
Χ = 30-76, n = 10 so that ἡ — 1 which is the number of degrees of 
freedom, is 9. Also the estimate of variance §? is given by 


§ = [Σὰ — X)?]/9 
= [2x? — (2x)?/10]/9 
= 1-251 3. 
Hence 
ὃ = 1-118 6, 
and δ! Μη = 1-118 6/4/10 
= 0-353 7. 
Hence, to = ( — p)/(S//n) 
= (0:76)/0-353 7 
= 2-149. 
From tables the 5 per cent point of fy is 2:26. The observed value of fy 
is 2-149 which is less than 2:26. Hence the result is not significant at 


the 5 per cent level and there is no reason to reject the null hypothesis. 
The « per cent confidence interval for u when yu is unknown is 


X— t,8[4/n, %+ t,8]r/n 


where ¢, is the « per cent point of ¢ with πὶ — 1 degrees of freedom. 
Thus in the above example the 95 per cent confidence interval for y is 
30-76 — 2:26 x 0°353 7, 30:76 + 2:26 x 0-3537 which equal 29-96, 
31-56, 1.6. the mean of the population lies between 29-96 and 31-56 at 
the 95 per cent level of significance. 


STATISTICS II 449 


16.6.3 Testing the difference between two means 
Given two independent samples of m,, and π2 observations, the 1- 
distribution is used to test whether the two samples may be regarded 
as drawn from the same normal population, assumed to be N(u, 0°). 
Let x, and x, be the means of the samples, then 
Δι = Nu, o7/n,), 
Xo ope N(p, o?/Ng) 
and xy peor Xo = N(O, o*(1/n, + 1/ng)). 
To estimate o° it is necessary to use a pooled estimate §? using data 
from both samples, where 
Pa ἴσα: — 1)? + U2 — X2)?] 
(m — 1) + Gz — 1) 
(πὶ — Ist + Ge — 1)s3 β 
" ny + ng — 2 ) 
Then S.E. (ἃ, — Xe) = §4/(1/n, + 1/ne). (9) 


From the null hypothesis that both samples are drawn from the same 
population, 


= |X, =", X.| 
“ΝΕ. (%, — 2) 


has a ¢-distribution with n; + nz — 2 degrees of freedom and can be 
tested as before. 


t 


Example The measurements of a sample of 10 components from one 
machine has mean 60cm and an estimated variance of 49 cm?. A 
sample of 16 components from a different machine has a mean 50 cm 
and an estimate of variance 100 cm*. Test whether both samples are 
drawn from the same population. 


Solution The N.H. is that there is no difference in the means of the 
two samples. Then with n, = 10, %, = 60, 53 = 49, ny = 16, ζς = 50 
and 523 = 100, the pooled estimate of variance is, using equation (8), 
32 = (9 x 49 + 15 x 100)/24 
= 80-875. 
Hence § = 8-994 and, using equation (9), 
S.E. (αι ~ X2) = 8:9944/(1/10 + 1/16) 

= 3-625. 
Hence t = |X, — X|/S.E. (ἃ, — %2) 
10/3-625 

= 2-759 
with 24 degrees of freedom. 
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From tables the value of ¢ for 24 degrees of freedom at the 5 per cent 
level of significance is 206 which is less than the observed value of 
2:759. Hence the result is significant, there is a significant difference in 
the means and the null hypothesis is rejected. 


16.6.4 Paired observations 


It may be possible to arrange a controlled experiment so that two 
samples each of the same number of observations are deliberately 
aired. 

Suppose x, and y,, i= 1, 2, 3, .. ., m are two sets of observations 
which can be paired and that their means are X and j respectively, and 
let Ζ, = x, — y, i= 1, 2,3,. . ., 2. The null hypothesis is that there is 
no difference in the populations from which the samples are taken. 
Hence if * = N(y, o2/n), § = Νίμ, o?/n), then 7 = αὶ -- " = N(0, 0?) 
where o2 is estimated by 52 = [Σ( — Z)*]/(@ — 1). The statistic 
t = |Z — 0|/(s,/+/n) has n — 1 degrees of freedom and may be tested 
as before. 


Example To compare the effect of different fertilizers A and B on the 
growth of tomatoes, a controlled experiment was performed using six 
different positions in a greenhouse and the yield in pounds weight was 
as follows: 


Position 1 2 3 4 5 6 
A(x) 13-2 111 12:2 12:6 10-4 140 
80) 12-9 11-4 10-3 10-8 9-8 13-5 


Test whether one fertilizer is significantly better than the other. 


Solution The N.H. is that there is no significant difference between A 
and B. Here 


z=x—y|03 —-03 19 18 06 055, 
and Xz, = 4:8, 2 = 0:8, &z? = 7-64 and (2z,)? = 23-04, so that 
s2 = {7:64 — 3(23-04)] = 0-76. 
Hence s, = 0-871 8, s,/4/n = 0-355 8 and finally 
t = 2-252. (= 0-8/0-355 8) 


From tables, t = 2.57 for 5 degrees of freedom so that the difference 
is almost significant at the 5 per cent level, and there is insufficient 
evidence from the data to show whether one fertilizer is better than the 
other. 
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16.7 THE CHI-SQUARE DISTRIBUTION 

Let X1, X2, X3,. . ., X, each be normally distributed with mean zero 
and unit variance. Then U = ΧΕ + X¥24+ X24...+ X? has a 
known distribution called the Chi-square distribution, written 7? distri- 
bution, with n degrees of freedom. This distribution which is continuous 
is tabulated and Table 5 of Cambridge Elementary Statistical Tables 
gives the percentage points of the y? distribution. An extract from this 
table is given in section 16.7.3. 


16.7.1 Properties of the ? distribution 

(i) If Ὁ, Uz, Us, ..., Um have independent x? distributions with 
M1, Ng, Nz, . . + Nm degrees of freedom, then U, + U,+ U3 +... 
+ U,, has a x? distribution with n=n, - πὸ Ἔ πο Ἐν... Ἐπ 
degrees of freedom. 

(ii) If the X, are not independent but satisfy k linear equations of the 
form 


aX, + Qy2Xo + QinX3 + oe + QinXy, = 0, i= 1, 2: 3, ery k, 


these k equations give k constraints to the system and it can be shown 
that X?+ X2+ X3+...+ X? now has a y? distribution with 
n — k degrees of freedom. 


16.7.2 Distribution of sample variance 


Let x1, X2, X3, «νον» X, be a random sample of n independent values 
from a normal population of variance o? and let ¥ be the sample mean. 
Then the sample variance s? is given by 


5? = [X(x, — X)?]/n. 

2 _ 2)2 

Hence sale = > σΕΞῸΣ 
σ o 


and since (x, — X)/o is approximately N(0, 1), then ns?/o? has a y? 
distribution with one constraint, for [X(x; — x)]/o = 0. Hence ns?/o? 
has a x? distribution with (ΟἹ — 1) degrees of freedom. 


16.7.3 Short table of ? 


Significance level 


Degrees of freedom 


5% 1% 
2500 | 30-58 
18-31 | 23-21 
11-07 | 15-09 

3-84 6:63 


Table 16.4 
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Notice that if ¥ = N(0, 1), x? = X? and 
P{|X| > 1-96} = 5 per cent, 


or P{X? > 1-967} = 5 per cent 
1.6. P{X? > 3-84} = 5 per cent, 
or P{y2 > 3-84} = 5 per cent. 


Example A random sample of eleven independent measurements gave 
an unbiased estimate of s? equal to 9-65 mm?. Could the sample be 
reasonably regarded as drawn from a normal population with variance 
5-5 mm?? 


Solution The N.H. is that the variance σ is 5-5. From the given data, 
ns?/o2 = 11 x 9-65/5-5 = 19-30, and from tables of y? with 10 degrees 
of freedom, 

P{y? > 19-30} < 5 per cent. 


Hence the value found is significant and the null hypothesis is rejected. 
The sample is not regarded as drawn from a normal population with 
variance 5-5 mm?. 


16.7.4 Application of 7? to goodness of fit 

Earlier in this chapter methods of testing parameters were shown on 
the assumption that the distributions selected were correct. The aim 
now is to test the form of the distribution assumed. On the assumption 
that a certain distribution is correct, it is possible to evaluate expected 
frequencies and compare these with observed frequencies. Let Εἰ, 
i=1,2,3,...,k and O, i=1, 2, 3,..., & be the expected and 
observed frequencies for k possible outcomes of an experiment. Then 
the measure of discrepancy U say, between the observed and expected 
frequencies is chosen to be 

& (0; — E,)* 
ΕΣ τ 


If O and E differ, U will be large and it is necessary to have ἃ significance 
test for U. For large numbers of repeats of the experiment, the distribu- 
tion of U approaches a 7? distribution with k — 1 degrees of freedom. 


Example (i) A die is rolled 120 times. For an unbiased die each face 
occurs with the probability 4 so that each face is expected to occur 20 
times. The experiment gave the result in the following table which shows 
both observed and expected frequencies: 


Score 1 2 3 4 5 6 
Observed frequency | 15 18 30 16 24 17 


Expected frequency |20 20 20 20 20 20 
Are the data consistent with the expected frequency? 
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Solution The N.H. is that the die is unbiased. Then 


(O-E) (- 5} | (—2)? | 10? (-Φῷ 4. (-3)? 
DE Gt 


+--+ -Ἐ- ΠὶΞ-: 


20 20 20 20 20 


= 8:50, 


and there are 6 — 1 =-5 degrees of freedom. 

From tables, 72 = 11-07 at the 5 per cent level, and the observed 
value of 8-5 is less than 11-07 and the result is not significant. There is 
no reason to suspect the bias of the die. 


Example (ii) In a certain manufacturing process any one defective item 
occurs randomly in time and is independent of the time any other 
defective occurs. Records give the frequency of occurrence O, of x 
defects in one day as 


x 0 1 2 3 4 5 
O, 305 365 210 80 28 12 


Test the goodness of fit of a Poisson distribution whose mean is 1-2. 


Solution The N.H. is that the distribution is a Poisson distribution 
whose mean is 1:2. Expected frequencies E, are given by 


E, = [1 000 exp (—1-2)}(1-2)*/x!, 
and the values in the following table are calculated: 


x O; Ε, Ο-- Ε (0 -- ΕΞ (Ο -- E/E 
0 305 301:2 3:8 14-44 0-047 94 
I 365 361-4 3:6 12:96 0-035 86 
2 210 2168 —68 46:24 0:213 28 
3 80 867 --67 44:89 0-517 76 
4 28 260 20 400 0:153 84 
5 12 76 44 19-36 2°547 37 


Total 3-516 05 


Table 16.5 


Thus x? is 3.5 and the number of degrees of freedom is 6 — 1 or 5. 
From tables the 5 per cent level of 7? is 11-07 which is larger than the 
observed value of 3:5. Hence the result is not significant and there is 
no reason to reject the null hypothesis. That is, in this case, there is no 
reason to believe that the defects are occurring other than at random, 
and from this data there is no evidence of a fault in production. 


16.7.5 Modifications 


(i) When the population distribution is not known completely, and it 
is required to test the hypothesis that a sample is drawn from a specified 
distribution the parameters of which are not known, it is necessary to 
estimate the parameters using observed data from the sample. In this 
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case the resulting x? is tested as above but the number of degrees of 
freedom is reduced by the number of parameters estimated. That is, the 
number of degrees of freedom is given by 


(the number of classes) — 1 — (the number of parameters estimated). 


(ii) If the expected frequency Εἰ in any one class is less than five, it is 
necessary to combine class results as follows, for example, 


Class | 1 2 3 4 5 6 


οι 5 6 13 18 17 16 


Ει 2 3 15 20 17 15 


The first and second classes are combined to give 
Class | land2 3 4 5 6 


οι 11 13, 18 17 16 


Εἰ 5 15. 20 17 15 
The number of degrees of freedom is now 
5 — 1 — (the number of parameters estimated). 


16.7.6 Contingency tables or test of independence 

Another useful application of the y? test occurs when one sample is 
classified according to two factors, and it is required to know whether 
these factors are independent. It is usual to represent the data in a 
contingency table. 


Example A group of 143 students is tested in mathematics and physics 
and graded «, 8, y in each subject. Use the x? test to test the hypothesis 
that the grades in mathematics and physics are independent. The con- 
tingency table for this is given in Table 16.6. 

Mathematics Grades (B) 


ᾶ β γ Totals 

So « 33 13 5 51 
n 
3 «OB 6 49 il 66 
ab 
B30 «CY 2 3 21 26 
Ἔ; 
[5 

Totals | 41 65 37 143 

Table 16.6 


Before considering this particular example, consider a general con- 
tingency table containing r rows and s columns. Suppose factor A has 
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r levels A;, Az, Ag,. . ., A, and factor B has 5 levels By, Bo, Bs,. . ., B, 
and let there be m,, observations in class (A,, B,) and the contingency 
table is shown in Table 16.7. 


8. 8, B; «4.20 8, Peery B, Totals 
Ay Ay πιὰ N13 eae ἂ Nyy oe Mrs n= x πὰ, 
͵ 
Ag 21 Nog Nag ee Noy ὡρῶν ῦ Nes Ng. = x Noy 
I 
44 N31 N3a N33 teks Nay wee: Nga n3. = x Noy 
ὃ 2 
A, Ny Nig nig es ees ny oa Oe Nis nn, = x ny 
. 7 
4, Mn Ma the owe OTP es πὶ np = Diy 
͵ 
Totals ny Ng Ng ny Nos 
= = = wale: os = see = n 
x ny x he x Ng Σ ny = Ns 
{ 1 1 1 t 
Table 16.7 


On the null hypothesis that the two factors A and B are independent, 
the probability that an observation falls in class (A, B,) is the product 
of the probability that it falls in A, and the probability that it falls in 
By, or pis = (p;. p.3). Hence expected frequencies 

Eis = mpi = Mi. Ps 
and 77 =  Σ(Ο — ΕΥΕ 
=> (πῃ — mpi. p-s)? 
tJ MP1. Ps 
with r x s — 1 degrees of freedom. 
In general p,. and p., are unknown parameters and are estimated by 


P;. = proportion of observations in A, 
= n,/n, 
p.; = proportion of observations in B, 
= n.,/n, 
where 7;. = > ny and n., = 2 ny, and finally 
3 


Y= ΣΣ (ay — ἥν n,n)? (10) 


n,. ποία 
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Since Σ Ps = 1 and > p-+ = 1, the total number of degrees of 


freedom i is reduced by one “for each parameter estimated so that there 
arer — 1 +s — 1 parameters estimated. Hence the number of degrees 
of freedom for the contingency table is 


r—-1—-(r-—-1)-G-)D)=C—VDG6-— 1). 
Hence testing independence in Table 16.6, 
E(Gp%&m) = 51 x 41/143 = 14-62, 
E(¢pBm) = 51 x 65/143 = 23-18, 
E(apym) = 51 X 37/143 = 13-20, etc. 
where a, means a grade « in physics, etc., and 


(33 — 14-62)? (13 — 23-18)? (6 — 13-20)? 


i ="—462 ἐς 3348 ++ 13-20 
(21 — 6-728)? 
6728 
= 106-52. 


Since y? = 9-49 for (3 — 1)(3 — 1) = 4 degrees of freedom at the 5 per 
cent level, this result is highly significant and the hypothesis of inde- 
pendence is rejected. 


Special case of 2 Χ 2 contingency table 


The 4? test is an approximation which improves as the number of cells 
in the contingency table increases. In the special case of 2 x 2 tables 
(i.e. r = 5 = 2) the approximation is improved by using Yate’s continu- 
ity correction. The conventional notation for a 2 x 2 contingency table 
is shown in Table 16.8. 


Characteristic B 


x I It Totals 
2 
κ᾿" 
- 
8 1 a b n 
9 1 
a 
Ξ 2 c d Ng 
ῳ 
Totals ΝΣ te ἝΞ 


Table 16.8 
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To apply Yates’ correction, decrease by } those O > E and increase 
by } those O < E and suppose the corrected table is 


a’ b’ ny, 
c d’ Nog 
a ἡ 
Then it may be shown that an equivalent expression for y? is 
n(a'd’' — b'c’)? 


AYNgNgN 


2 


Example It is thought that people who wear spectacles for reading have 
jobs which involve much reading. Two hundred people were investi- 
gated and the following table records the result. 


Ss Non-S Totals 
R 74 31 105 
Non-R | 46 49 95 
Totals | 120 80 200 


Test the independence of the two factors. 


Solution The N.H. is that the two factors are independent. 

E(R, S) = 105 x 120/200 = 63, O> E, 

hence decrease 74 by }. 
E(R, Non-S) = 105 x 80/200 = 42, O<E, 

hence increase 31 by 4. 
E (Non-R, S) = 95 x 120/200 = 57, O<E, 

hence increase 46 by }. 
E(Non-R, Non-S) = 95 x 80/200 = 38, O>E, 

hence decrease 49 by 4. 


The corrected table is 


Ss Non-S Totals 
R 73-5 31-5 105 
Non-R | 46:5 48-5 95 


Totals | 120 80 200 
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From the table, 


(7355 x 48-5 — 465 x 31:5) 
20a SE a eo ee pe oe ἀπὲ 
re a ROL IOS ROS  ὼ 


= 9-219. 


From tables of y?, the 5 per cent level of 7? with one degree of freedom 
is 3-84. 

The observed value of x? is 9-219 which is greater than 3-84, and the 
result is highly significant and there is no reason to suppose indepen- 
dence of the two factors in this group of people. 


16.7.7 Test of homogeneity 


The x? test may be used to determine whether several samples are 
drawn from the same population or populations having the same 
distribution. The precise nature of the distribution is unimportant. 

Suppose s different samples are drawn and a given characteristic 
observed for each one, and let there be r classes of this characteristic. Let 
my be the frequency with which the ith class of the characteristic 
occurs in the jth sample and the results be recorded in a table of the 
same form as Table 16.7, where B, is the jth sample and 4, is the ith 
class of characteristics. To test the null hypothesis that the s samples 
were drawn from populations having the same distribution, the method 
of evaluating χϑ is exactly the same as that of section 16.7.6 even 
though the mathematical model is different in that the column totals 
are not random but are the stated sample sizes. The number of degrees 
of freedom is now (r — 1)(s — 1) also. For, referring to the notation 
of Table 16.7, for the first sample, 


Lasy = > (My — πὰ Ρυ [πὰ Pi 
1st sample 


or (r — 1) degrees of freedom. 


For the jth sample, 


Xam sample) — > (143 — πὸ Pi)? πο Pi 
jth sample 


with (r — 1) degrees of freedom. 


Summing over all j = 1, 2, 3,.. ., 5, 


σιν -- n.; pi)” 
2 = = 
2 Σ Σ Ne; Pi 


with s(r — 1) degrees of freedom. 
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If now p,, i = 1, 2, 3,. . ., ris estimated by the corresponding relative 
frequency n,./n, 


(14 — πὸ, n/n)? 
2 δῶρο πο Ὡς 
χ Σ Σ ny n/n 


with s(r — 1) — (ἡ — 1) degrees of freedom which is (r — 1) — 1) 
degrees of freedom. 
Hence the procedure is: 


(i) find the expected frequencies E,, = n., n,./n, 
(ii) calculate y?, and 
(iii) refer to the y? distribution with (r — 1)(s — 1) degrees of freedom. 


Example The distributions of breaking tension levels of three different 
substances are as follows: 


| Sub I Sub II Sub III | Totals 


Tr 140 161 133 434 
Ta 79 63 82 224 
Ts 31 36 25 92 
Totals 250 260 240 750 


Is the distribution of breaking tension level the same for the three 
substances? 


Solution The N.H. is that the breaking tension level is the same, i.e. 
all substances are from populations with the same distribution. The 
expected values are, 


ΕΑ, T,) = 250 x 434/750 
= 144-67, 

EC, T2) = 74-67, 

EC, ΤΆ) = 30-67 etc. 


and 
ee 4-67? 4-332 0-33? 4-442 
= 44-67 + 7467 + 3067 +--+ + 5524 
= 8-279, 


The number of degrees of freedom is given by (3 — 1)(3 — 1) = 4. 
From tables, 7? is 9-49 which is greater than the observed value of 8-28, 
and hence the result is not significant at the 5 per cent level. From the 
given data there is no reason to doubt the null hypothesis. 
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EXERCISE 16 


1 


If x is normally distributed with mean μ = 10 and variance o? = 9, 
calculate the probability that (i) x > 11, and (ii) if a random sample 
of size 16 has mean x, that x > 11. 


A machine produces circular parts whose diameters are known to 
be normally distributed with standard deviation 0-019 πὶ and the 
mean diameter is supposed to be 0-500 m. A random sample of ten 
is taken and the sample mean is found to be 0-488 m. Does the data 
from the sample indicate a change should be made in the machine? 


3 The mean range of a certain type of gun is 1 000 metre and the 


range standard deviation is 124 metre. After two years’ storage 
fifty rounds are fired and the mean range is found to be 872 metre. 
Test whether storage has changed the mean range. 


4 A firm claims that 88 per cent of the steel rods they produce have 


a breaking tension of more than 1000 newton. A sample of 300 
rods was tested and only 249 rods had this breaking tension. Does 
this discredit the firm’s claim? 


5 The diameters of a set of ball bearings are assumed to be normally 


distributed with a standard deviation of 0-1 cm. What size sample 
should be taken in order that the estimate of the mean diameter of 
the ball bearings shall differ from the population mean by more 
than 0-005 cm only once in 100 times ?. 


6 In a certain county in one year 90 boys entered school A and 90 


boys entered school B, after the results of tests in Mathematics and 
English. The mean of the marks of the boys entering school A was 
63 and that of the boys entering school B was 60. If the variance 
was 144 in both cases, is the claim that the entry requirements for 
both schools is the same, justified by these data? After one year the 
same boys were each given another common examination. In school 
A, 10 boys had left the school, so that 80 boys had an average mark 
of 61:5 with variance 900 and the 90 boys of school B had an 
average mark of 65 with variance 576. Can any conclusion be drawn 
about the superiority of school B? 


7 Two researchers X and Y each tested the growth of a sample of 


fifty plants of the same species over a fixed period of time. The 
following table shows their results: 


Xx Y 
Mean growth in cm 18-04 19-38 
Variance in cm? 8:85 9:12 


Does the smaller variance for X imply he is a better researcher than 
Y? Determine whether the difference in the means is significant 
(i) at the 5 per cent level, (ii) at the 1 per cent level. 
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8 Two classes take the same test. Class A consists of six students and 
class B consists of eight students. The marks awarded are 
Class A | 830 829 827 828 826 826 
Class B | 828 823 824 827 824 826 824 823 


Test whether the mean marks of classes A and B are significantly 
different. 


9 Tests showed that of 100 women drivers and 200 men drivers, 30 
of the women and 75 of the men were not good drivers in that they 
failed the tests. Use these data to test the claim that women are not 
such good drivers as men. 


10 A sleeping drug and a harmless control tablet were tested in turn on 
nine patients in hospital and the number of hours’ sleep recorded 


as follows: 
Patient number 1 2 3 4 5 6 7 8 9 
Under sleeping drug 9:5 7:7 9-1 5:3 6710-4 78 9:9 8-7 
Under control drug 75 74 95 50 5.3 90 7:2 81 8&9 


Is the sleeping drug effective? 


11 Nine students were tested before and after being coached in mathe- 
matics and their marks were 


Student number 1 2 3 4 5 6 7 8 9 
Before coaching 55 61 63 47 76 45 48 60 62 


After coaching 62 64 62 51 73 50 54 56 63 
Do these data show that the coaching was effective? 


12 The number of absences due to sickness in two factories over a 
period of six months was recorded as a percentage with the following 


result: 
Month Oct. Nov. Dec. Jan. Feb. Mar. 
Factory A 39 48 57 63 71 78 
Factory B 45 51 66 74 82 73 


Use the t-test to examine whether there is any significant difference 
in these two sets of data. 


13 Two machines are used to make identical rods. Random samples of 
ten parts from machine A and seven from machine B were measured, 
and the results recorded. 

Machine 4 (mm) | 11-1 11-5 11-4 11-0 109 111 112 11-5 113 114 
Machine B(mm) | 11:2 10-6 11-1 103 118 109 11-7 

Do these data indicate a significant difference in the mean lengths 
of the rods from the two machines? 


14 Capacitors of nominal value 100 pF are being produced and over 
a large number 15 per cent are rejected as being not of the required 
standard. Samples of 5 are taken at regular intervals and tested. 
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(i) Use the binomial expansion to calculate to three decimal places 
the probabilities of there being 0, 1, 2, 3, 4, 5 rejects in a sample. 
(ii) Calculate the probabilities of the various numbers of rejects 
using the Poisson approximation. (iii) Ten consecutive samples each 
of five capacitors contained 1, 2, 3, 0, 2, 3, 1, 3, 2, 2 rejects respec- 
tively. Use the probabilities calculated in (i) and (ii) to calculate 
two different values of y? to test whether the apparent increase in 
the percentage defective is significant. 


15 Test the goodness of fit of the following data from 500 trials of an 
experiment to a Poisson distribution whose estimated mean is 1-3. 
x 0 1 2 3 4 5 6 
Observed frequency 132 180 121 51 10 5 2 
16 The cost per meal in canteens of different size, serving different 
numbers of meals per day, is listed in the following table. Do these 
data support the belief that larger canteens produce cheaper meals? 


Cost of meal in new pence 
Number of meals 
served under 30 30-40 40-50 over 50; Totals 
100-499 49 84 74 26 233 
500-999 40 77 132 59 308 
Over 999 43 76 140 103 362 
Totals 132 237 346 188 903 


17 Test the y? goodness of fit of the distribution of problem 18, 
Exercise 15. 


18 Manual dexterity and success in theoretical examinations were 
graded over a period of years in applied science students. The 
following data were recorded. Test whether manual dexterity and 
success in examinations are independently distributed. 


Manual dexterity 


Examination α β γ ὃ ω 
a 1369 171 1 042 399 2 
B 2 578 475 2 704 933 12 
γ 1.391] 421 3 827 1 843 36 
ὃ 455 256 1 849 2 507 113 


17 
Applications 


17.0 INTRODUCTION 


In this chapter a number of physical problems are considered and 
mathematical equations or statements which attempt to describe the 
physical situation are derived. 

The aim is to indicate some of the ways in which the mathematics 
of the previous Chapters may feature as part of an engineering type 
problem. To take an example, the hyperbolic functions were introduced, 
by definition, in section 4.5 and their relationships with the trigono- 
metric functions shown in section 10.6 but the engineering student still 
may not appreciate the need to study such functions until some prac- 
tical use is described. Two illustrations of the way in which the hyper- 
bolic functions may feature are given in section 17.2.2 where they appear 
as the solution of a differential equation obtained from the model of 
a real situation. 

When an engineer is confronted with a problem it frequently happens 
that any attempt to describe the complete situation in mathematical 
terms leads to equations of too great a complexity. In this case the way 
forward is to replace the physical problem with a simpler model in 
which some of the variables have been ignored. The engineer may often 
be called on to use his skill and ‘know-how’ to decide which variables 
should be retained and which omitted. The illustrations of the following 
sections may be looked upon as simple models of a more complicated 
real-life situation. For example, the spring systems of section 17.3.3 are 
idealised in that, first the damping/frictional effects are not taken into 
account, and second, the stiffness of each spring is presumed constant 
whereas it is possible for the stiffness to depend on temperature and 
therefore on the rate at which the displacements take place. These 
additional factors (and others) when taken into account lead to 
mathematical equations which would be excessively complicated and 
would obscure the basic fact that the eigenvalues can be related to the 
natural frequencies of oscillation. With this fact well-established the 
equations could be modified to take the additional factors into 
account. 

From a mathematical standpoint the solution of the mathematical 
equations is the solution of the problem, but from an engineer’s stand- 
point this solution is only one step towards solving the physical 
problem. The engineer must be able to interpret the mathematical 
solution in terms of the physical variables remembering that the 
mathematical one may have been derived from what was only a model 
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of the real situation. The mathematical solution may sometimes be 
used to assess the justification of the omission of certain variables. 

The problems considered here are simply a selection from a much 
wider range of problems of the type that the engineer could encounter 
both in his academic course and in practice. 


17.1 INEQUALITIES. LINEAR PROGRAMMING 


The engineering world is concerned very much with inequalities. It is rare 
that a measurement is precise, generally a variable is found to lie 
within a band of values and the symbols < or > are of frequent 
occurrence. The student should know the difference between equality 
and inequality and in certain circumstances this may more readily be 
observed in a graphical sense. For example the equation 


2y + 3x = 6 


may be represented in (x, y) space as a straight line of slope (-- ὃ) 
having intersects of lengths 2 and 3 on the x and y axes, respectively 
(see Fig. 17.1). 


Fig. 17.1 


The equation 
2y + 3x <6 


however, represents a region of the (x, y) space and is shown by the 
unshaded region of Fig. 17.2. 
Likewise the shaded region is given by 2y + 3x > 6. 

A simple use of such graphical regions to represent inequalities 
equations and their use in solving a management problem is now given. 


Example A factory is to be designed to accommodate two types of 
machines to make a particular product and the manager has to 
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2y+3x>6 


Fig. 17.2 


decide how many of each type to install. The specification is as 
follows: 


Floor area Operators Profit units 
Type A 4000 11 9 
Type B 7000 8 12 


The maximum factory area available is 56,000 and there are 88 
operators available to make the product on either machine. The factory 
must make a profit of not less than 90. How many of each type of 
machines should be used 

(a) to obtain the maximum profit? 

(b) to make the minimum number of redundancies? 


Solution Suppose x machines of type A and y machines of type B are 
installed then the following inequalities arise: 


Area 4x + Ty < 56 
Employment llx + ὃν < 88 
x>0,y 20 


If P is the profit then 
P=9x + 12y > 90. 


Representing the various inequalities in (x, y) space and shading 
the unacceptable regions leads to Fig. 17.3. The unshaded region is the 
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“operational zone’ and there are five operational ‘points’ within the 
zone, i.e. (1, 7), (2, 6), (3, 6), (4, 5), (5, 4) 


f+ ann ὦ 


S 


(a) the line 9x + 12y = P is parallel to 9x + 12y = 90 and the 
profit P will be maximum when the perpendicular distance from the 
origin to the line 

P=9x + 12y 


is greatest. Naturally it is necessary for the line to pass through one 
of the operational points (x, y must be integers) and Fig. 17.3 shows that 
the line must pass through the point (3, 6). Under this condition 


x=3,y=6 

Area = 54,000 

Employed persons = 33 + 48 = 81 
Profit = 27 + 72 = 99 


There are thus, 2,000 units of area not used and 7 operators cannot be 
employed. 
(b) The minimum number of redundancies will appear when the line 


P = 9x + 12y 


parallel to 9x + 12y = 90 in the operational zone, passes through the 
operational point nearest to the line 11x + ὃν = 88. 
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From Fig. 17.3 this is observed to be the point (5, 4). Under these 
conditions 


x=5,y=4 

Area = 48,000 
Employed persons = 87, 
Profit = 93 


Thus only one person is declared redundant (or is promoted) but 8,000 
units of area are unused and the profit is reduced to 93. 

Other possibilities appear on examining the Figure. One obvious 
possibility would be to operate at the point (5,5) within the profit/ 
area zone but outside the labour zone. This leads to 


Area = 55,000 
Operators = 95 
Profit = 105 


This operational point would utilise practically all the area available, 
give a larger profit but would require the recruitment of 7 more machine 
operators. Whether this is a possibility would depend on other factors 
outwith the present consideration which may in turn lead to a further 
problem of this type. 

This type of problem which leads to a mathematical model as a set of 
m linear equations with π΄ unknowns together with a target function 
which requires maximising or minimising is called a linear programming 
problem. When the number of unknowns is greater than 2 the graphical 
approach is impossible (with 3 unknowns the linear equations are 
represented by planes in 3-dimensional space and with 4 unknowns 
there is no graphical representation at all) and it is necessary to resort 
to other techniques. There are many techniques available with the 
general heading of linear programming and they usually start by setting- 
up the mathematical equation in matrix form. If a maximisation of some 
function is required, this is often achieved by iteration i.e. start with an 
initial estimate of the variables and then use some process repetitively 
to improve the estimate until eventually the optimal values are reached. 

The applications of linear programming in engineering are many 
and varied. A distribution problem may be such that different sources 
Sy, 5.2, S3,. . hold stocks P;, Py, Ps. . .and supply a number of 
different consumers C,, C2, C3. . . whose demands are ἢ, Do, D3... .- 
In this case the cost supplying consumer C; from source S; may be 
written A,; and the matrix (A;,) is the cost matrix. The problem is to 
meet the demands of the consumers in the most economical way, i.e. 
to keep costs to a minimum. Such distribution type problems may be 
met where there is supply and demand of materials (raw or manu- 
factured) or services (electricity, gas, water). 
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17.2 EXPONENTIAL FUNCTION. HYPERBOLIC FUNCTIONS. 
DIFFERENTIAL EQUATIONS 


The exponential function e** was introduced in Chapter 4 as the solution 
of a particular differential equation 


dy 
ee 


This is an important differential equation which makes its appearance 
in many different contexts. 


17.2.1 Natural law of growth. The exponential function 


A variable is said to satisfy this law of growth if its rate of change is 
Proportional to the value of the variable, i.e. if y is the variable, then 


dy 
a“ 
Some simple ways in which this law appears will be illustrated. 

(i) Under certain conditions, populations of animals, people, or 
bacteria satisfy this law, i.e. that the rate of change of a population is pro- 
portional to the population. With a positive constant of proportionality 
kK the population is then seen to have exponential growth y = A e*t, 
where A is constant. The shape of the curve of γ is similar to the shape 
of Fig. 4.1 and increases rapidly for t > 0. If this represents the behav- 
iour of the world human population it gives good indication of the 
rapidity of the increase of population with time. Fortunately, there are 
indications that the growth law of human population is governed by 
an equation of the form dy/dt = ky(c — y), where ὁ is a constant. This 
differential equation may be solved using the method of section 11.1.2 
and has the solution 


y = Ac|(A + ς “κἢ 
where A is an arbitrary constant. In this case the population has a 
limiting value given by 
Limit [Ac/(A + εἰ 595] 
to 
= Ac/A=c. 

This population growth case is illustrated in Fig. 17.4. 

(ii) If a hot body at temperature T is allowed to cool towards the 
constant temperature 70 of its surroundings, the law of cooling (as 


propounded by Newton) states that the rate of change of temperature 
is proportional to the temperature excess, i.e. 


aT 
ad a (T — Tp). 
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y 


Fig. 17.4 


If this is a cooling situation then the rate of change is negative and 


ΩΤ 
dt = —k(T — Ty) 
where k is a positive constant dependent on the material of the body and 
its surrounding. Replacing T — Ty by @ then 
do 
τ Ξ —ké 
leads to θ = T — Ty) = Ae~*, 
The cooling curve for a body of initial temperature 7, is illustrated 
in Fig. 17.5. 


Fig. 17.5 
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(ili) Suppose a rope is passed round a rough circular bollard and 
forces applied at each end of the rope. The tension at any point of the 
rope may be evaluated as follows. Refer to Fig. 17.6 and consider the 


Fig. 17.6 


equilibrium of forces acting on the small element AB of the rope. The 
forces acting are T at point A, T + 67 at point B, overall reaction of 
the bollard R, and frictional force F. Hence for the small element of 
rope AB 

(T + OT) sin (360) + T sin (460) = R 


(T + OT) cos (460) ~ T cos (466) = F. 


In the case of limiting friction (i.e. where the rope is about to slip) 
then F = wR where μ is the coefficient of friction between rope and 
bollard. 

Replacing the trigonometric functions by the first term of their 
Maclaurin expansions, i.e. sin 360 = 366, cos 466 = 1, and neglecting 
products of small quantities leads to 


OT = Fu (2T + 6T)(466) 
Ξε μτὸθ 
so that 67/60 = μΤ and, in the limit as 60, 6T --» 0, 


ΘΕ. 
ao. “΄' 


This is the natural law of growth and so the solution is T = A οἱ 
Now, T = Po when 6 = 0, and T = P, when 6 = 4, and hence 


Py = Pye". 
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If two turns are taken around the bollard then ¢ = 4π and, if the coeffi- 
cient of friction is taken as 4, then ῥ᾽ = Py εἰ" = Pye”, i.e. Py = 23Pp. 
Thus for 2 turns on the bollard a pull at one end will sustain a tension 
23 times as large at the other. This is the basis for the ship’s capstan 
and the band-brake. 


17.2.2 The hyperbolic function 


The hyperbolic functions were introduced in Section 4.5 of Chapter 4 
and used in deriving the solution of some differential equations in 
Chapter 6, the relationship between the trigonometric and the hyper- 
bolic function was shown in Section 10.6 and it is because of the con- 
nection with the well-known sine and cosine that the basic hyperbolic 
functions are called hyperbolic sine and hyperbolic cosine. The con- 
nection with the hyperbola is not quite so obvious. 

The curve of intersection of a right-circular cone with a plane is 
called a conic section. The three basic sections are the ellipse, the para- 
bola, and the hyperbola. In Fig. 17.7, the three types of intersection 
are shown: 


aN 
ἷ = =e 


(a) (b) (c) 
Fig. 17.7 


Fig. 17.7(a) shows the section when the plane intersects the cone in a 
closed curve. This section is called the ellipse. Fig. 17.7(b) shows the 
situation when the plane is parallel to the side of the cone giving an 
open curve called the parabola. The Fig. 17.7(c) shows the intersection 
curve that has two branches which together form the hyperbola. 
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It is shown in books of analytical geometry that the ellipse, referred 
to its symmetrical axes (see Fig. 17.8(a)), has an equation 


Fig. 17.8 


whilst the hyperbola, Fig. 17.8(b), has the equation 
Χ _y 


at: 03 
The two lines AB and CD towards which the hyperbola approaches at 
infinity are called asymptotes and have equations y = +(b/a)x. 

When ὁ = a, the ellipse has the equation x? + y? = a? and the 
particular symmetric shape is a circle. The equivalent hyperbola (with 
b =a) has the equation x? — y? = a? and is called a rectangular 
hyperbola (i.e. one whose asymptotes, of equations y = + x, cut at 
right angles). 

It is well known that the circle can be represented by its parametric 
equations x = acos 0, y = asin 9, since cos?0 + sin?@ = 1, and for 
this reason the trigonometric functions sin 6 and cos θ are sometimes 
called the circular functions. In like fashion the parametric equations of 
the rectangular hyperbola are x =acosh¢, γ =asinh¢, since 
cosh*f — sinh?f = 1. The two functions sinh¢ and cosh¢ are then 
called Ayperbolic functions. 

The hyperbolic functions are obviously important from a geometrical 
and mathematical viewpoint but they are important functions that also 
occur in a natural way as the following illustrations show. 


2 


=I. 


(i) The catenary 


If a heavy chain is suspended under gravity between two points, the 
curve that the chain forms is called a catenary, e.g. overhead electricity 
cable, or telegraph wire. 
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T+ 87 


O x 
Fig. 17.9 


Let s be a distance measurement along the chain and consider the 
equilibrium of a small portion ds, 1.6. AB of the chain. The three forces 
acting are the weight ods, where o is the weight per unit length, and 7, 
and T + OT, the tensions at the ends A and B. Resolving forces on 
AB, horizontally and vertically, 
(T + dT) cos(6 + 66) — Tcosé = 0, 
(T + 6T) sin(6 + 66) — Tsin6 = ods 
where tan @ is the slope of the catenary at A. Expanding sin(@ + 60) 
and cos(6 + 66) and using 
sin 060 = 66 
cos 66 = 1 
these two equations reduce to 
oT οοϑθ — T sin@ 60 = 0 
oT sin6 + T cos6 60 = ods 


after neglecting products of small quantities. 
Proceeding to the limit as 60, 67-0 leads to two differential 
equations 


aT 
mo T tan0 


. ΩΤ ds 
sind 7 = —Tcosé + oH 


The first of these differential equations can be integrated immediately 
(see Section 11.1.2) since the variable are separable and gives 


T cos6 = constant = 70 (say) 
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Elimination of dT/d6 and T in the second differential equation leads to 


d6 


σ 
Sees ταν ὁ 
τ τ. 


This differential equation can be expressed in terms of x and γ (see 
Fig. 17.9) by using the fact that 
dy/dx = tan6 


from which it follows that 


d*y on “ἶ 
ia ae (tan) = sec OF 
dO ds 
— 2)! a πολυ δι 
αὐτὸ Tae 
τωρ (2 coc2q) “δ, 
= sec*6 (F cos 0) 7: 
d’y _ 9 ds 
a dx? 7) dx 


Writing σίτῳ = 1/c and using 
ds|dx = βεοθ = V1 + tan?0 = V1 + (dy/dx)? 
the differential equation for y becomes 


dy 1 
= svi + Olde. 


This may be solved by putting p = dy/dx, as in Section 11.2.1, i.e. 


dp 1 5 
ἂχ ς VIi+p 
1.6. a : fax 
V1 + pr “ 
Thus sinh71(p) = x/e + A 
or p = sinh (4 + x/e). 


Suppose that the y-axis of Fig. 17.9 is now made to pass through the 
lowest point of the catenary then x = 0 when p = dy/dx = 0, i.e. 
A = 0 and hence 


p = dy/dx = sinh (x/c). 
Integrating again, 
y = ccosh (x/c) + B 
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Locate the x-axis at a depth c below the lowest point of the catenary, 
ie. y=c, x = 0, then B=0. Hence y = ccosh x/c is the equation 
of the catenary curve with axes as shown in Fig. 17.10. 


y =c cosh (a/c) 
C= τσ 


Fig. 17.10 


Thus the catenary is the shape of the hyperbolic cosine curve. This shape 
is frequently seen in practice, e.g. telegraph wires, electricity power 
cables. 

The length s of the curve may be found by integration. From 


it follows that 


= [ft + (sin =) dx = Joost (x/c) dx. 


Hence s = c sinh x/c + D. 

Measuring s from the lowest point, s = 0 when x = 0 hence D = 0 
and s = ὁ sinh x/c. 

Thus the length of the catenary y = cosh x/c from its lowest point is 
the hyperbolic sine function s = ὁ sinh x/c. 


(ii) The cooling fin 

In many physical situations it is necessary to extract the heat from a 
hot liquid (or gas), e.g. a heat exchanger, a cooling system. In the 
simplest situation the hot container simply radiates heat into the cold 
surroundings (the latter being maintained by a flow of cold air or other 
medium). The container is frequently a metal pipe and in order to 
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increase its surface area it is provided with cooling fins. A typical pipe 
is shown in Fig. 17.11. 


Fig, 17.11 


The cooling fin may take a variety of different shapes the cross-section 
being rectangular, triangular, or parabolic. Consider the case of a fin 
of constant rectangular cross section and the flow of heat through it. 
Fig. 17.12 shows the typical cross-section. Let T be the temperature at 


Fig, 17.12 


distance x along the fin and g, be the quantity of heat flowing over the 
cross-section of area A, let g, + 6g, be the heat flowing over the cross- 
section of the fin at distance x + 6x and let dg, be the heat flowing into 
the cooling medium from the section of thickness ὄχ. Let / be the length 
of the fin. The heat balance equation over the section of width 6x at 
distance x is 


Qn = (Gn + OGn + δα) 
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Now, g, is a function of x only and hence (see equation (7) of section 
5.1.3), 


ἐφ, 
δα, ΞΞ ae: ὃ 
and it follows that 
dn 
0Ge ΞΞ - We ὃ 


There are two basic empirical laws of heat transfer: 
(i) the law of conduction which states 


_ κ 4 
qn = τὰ Ox 
where K is a constant dependent on the thermal properties of the fin 
material and, 
(ii) the law of convection (Newton’s law of cooling), which states 


bq. = WT — Τὸ (Cox) 


where / is a constant dependent on the surface properties of the fin and 
on the cooling medium; T, is the (presumed constant) temperature of 
the cooling medium and C is the circumference of the fin at distance x. 


dq, d { dT 
Thus, A(T — T,) Cx = — ae ὃχ = ἜΚ (4 =) Ox. 
Normally A and C are dependent on the distance x but if the fin is a 
rectangular one they are both constants and in the limit, as 6x — 0, 
the equation becomes 


ΩΤ hc 
de = KAT — Te) 


The assumption that 7, is constant implies that the heat is transferred 
to the surrounding medium and immediately transported elsewhere. 
In practice something akin to this idealised situation is achieved by a 
rapid flow of cooling medium over a hot fin. For example, the sump in 
certain motor cars is often of ribbed shape approximating to a series 
of fins and the surrounding cooling air is usually moving past the fin 
rapidly. 

It is convenient to replace T — T, by 6 and hence reduce the equation 


to 
430 
—— = [2 
715" Κκθ 


where k = 4/(AC/K αὶ is a constant. 
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The differential equation is easily solved (see Section 11.4 with 
A, μι = +k, —k) and θ is given by 


6= De*  Εε 


where D and Εἰ ἅτε arbitrary constants of integration. 

In order to evaluate these arbitrary constants it is necessary to fit 
boundary conditions to the problem. If the temperature T at the base of 
the fin is taken as 70 then one boundary condition has the form 


θ- Τὶ —T, = %, atx =0. 
A second boundary condition may be obtained by neglecting the heat 
loss at the extremity of the fin x = /, ie. by assuming that (4,):-ἰ 15 
zero. Use of the law of conduction (see (i) above) gives the second 
boundary condition in the form 


aT 6 
ΤΣ or FO atx =), 
Insertion of these two boundary conditions into the equation for 0 gives 
6=D+E 


0 = Det! — Ee-*! 
Solving for D and £ and using the fact that 
e? + εἰ = 2coshd 


quickly leads to 
6 = 6) cosh k(/ — x)/cosh kl 


1.6, T — T, = (Τὸ — Τῷ cosh Κ( — x)/cosh kl. 


The variation of temperature across the fin length thus follows an 
hyperbolic cosine curve as in Fig. 17.13. 


Fig. 17.13 
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Fins of shapes other than rectangular may be considered in a similar 
manner, the differential equations so arising being of additional 
complexity. It may be shown that the ideal shape (for optimum heat 
transfer) is of a parabolic cross-section and this is the shape frequently 
seen on practical fins. 


17.3 MATRICES. DETERMINANTS. EIGENVALUES AND 
EIGENVECTORS 


Engineering situations, when described by a mathematical model, 
frequently lead to a set of algebraical or differential equations in a 
specified number of unknown quantities. Matrix algebra may often be 
used to simplify the sometimes complex array of information. 


17.3.1 A simple electrical network 


Suppose an electrical network consists of a number of resistors together 
with batteries of given electromotive force (e.m.f.). It is required to 
find the current flowing in the various branches of the network. The 
Fig. 17.14 illustrates such a circuit the resistor values are in ohms and 
the e.m.f. in volts. 


Fig. 17.14 


Let the current (in amperes) be represented by cyclic currents 4, 18, 
and 75. flowing in the three branches of the circuit, e.g. the current 
flowing in the resistor of 5 ohms is (ἢ — J). 

The basic Kirchoff laws that govern the network are: 

(i) for each closed circuit the total, algebraic sum of e.m.f. is zero, 

and 

(ii) at each nodal point there is continuity of current. 
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The law (ii) is automatically satisfied by using cyclic currents, for 
example at modal point A the current flow is 


which shows that the current flow into the node is exactly balanced by 
the current flow out. 
The law (i) leads to the following set of algebraical equations 


10 = 6, — 1) + 5, — 2) + 31 
3 = 71. + 5(1. — h) + 2g — Is) 
5 = 2U3 — 1) + 6s — 1) + 1025 
This set of equations reduces to the following 
147, — 512 — 6/, = 10 
—5h, + 14%, — 21; = 3 
—6f, — 21. + 184, = 


14 -5 —6\ (ἢ 10 
—S 14 —2}{(L) =| 3 
—6 --2 18/ Vs 5 


This is of the form 


or in matrix form 


RI=E 


where R is the resistance matrix, 
115 the current matrix, 


and E is the emf matrix. 


Note that the matrix equation E = RI is the generalised form of the 
the first Kirchoff law. 
To find the current, the matrix equation is solved for 7 either by using 
the inverse matrix R7+ so that 
I=RUE 
or by employing Gaussian elimination. 


17.3.2 Forces in a frame network 


Suppose Fig. 17.15 represents a simple framework structure with 
hinged joints. It is assumed that there is no bending in any member 
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and that the external forces are applied at the joints. It is required to 
find the reactions at the supports and the bar tension (or compression) 
forces. 

Framework of this type are of common occurrence in the analysis 
of structures (e.g. girder bridges, cranes, skeleton of buildings) and are 
usually called trusses. 


Fig. 17.15 


In the framework of Fig. 17.15 there is an external force applied at 
P having a component 50kN acting in the negative z-direction and 
another component 100KN in the y-direction. The other numbers 
represent the lengths in metres and the bar members of the truss are 
labelled (1), (2) and (3). 

There are three reactions at each of the supports and each reaction 
has three components in the x, y and z-directions. In addition there are 
three tension forces along the three bars. In all this gives twelve un- 
known forces (3 x 3 + 3), and the appropriate number of equations 
is obtained by balancing force components at each joint giving 4 x 3 = 
12 equations. 

Let UV in Fig. 17.6 represent a typical bar member with an (assumed) 
tensile force F (i.e. internal bar force directed away from the joint). Let 
the forces have components (P;P2P3) and (R,R2Ra) respectively. 
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Fig. 17.16 


Balancing the forces at joint U gives 
Py + Foose =0 
P,+ Fcoosp =0 
P, + Fcosy = 0 


where (cos «, cos 8, cos y) are the direction cosines of the line UV. In 
structural engineering it is usual to work in terms of F// where / is the 
length of the bar UV and this ratio, denoted by T, is called a tension 
coefficient. Thus at joint U the force balance equations become 


Pi +Ta=0 
P3+ Tc =0 


where a, ὁ, and c are the projections of UV onto the axes of x, y, and z 
respectively. The actual bar force F is then found by multiplying T by 
the length of the bar. 

The Fig. 17.17(a), (b) show the plan and elevation force diagram for 
the truss of Fig. 17.15. The tension coefficients are 7,, 7. and 7. and 
the reaction force components are (P;P2P3) at A; (Κι R23) at B, and 
(5:.5..53) at C respectively. The external forces are indicated and the 
other numbers on the Fig. 17.17 indicate the lengths. 

Balancing forces at each joint in turn leads to the following equations: 


at A, P, —5T, =0 
P10 = 0 
P; + 307, = 0, 
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at B, R, + 107, = 0 
R, + =0 
Rg + 301; = 0, 
at C, S, + 107; =0 
δ. -- 107; = 0 
Sz + 30T, = 0, 
at P, 5T, — 10T, — 10T; = 0 


—50 = 307, = 307. — 3073 = 0, 


Poors 


— 
100 ΚΝ 


Lg 
x y 
Ἢ 93 
Se 
A ---- C 
Po Ro 
(a) (b) 
Fig. 17.17 


Naturally, with this simple truss it is possible to solve without resort 
to matrix analysis but it should be realised that the force balance 
equations of much more complicated networks can be derived by using 
this simple technique. When the number of unknowns (and equations) 
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is large it is often an advantage to set up a matrix. The set of 12 equations 
may be represented by the matrix equation 


i -5 Ρι 0 
1 10 P, 0 
1 30 Py 0 

Ι 10 Αι 0 

1 Ry 0 

1 30 R, |_| 0 

1 io [Π 5.[{Ξ] ὁ 

Ι --ἰὸ 1} 5. 0 

1 30 [] Ss 0 

10 S10} lr 0 

10 —~10] 10, 100 

30 30 30] 1, —50 


In this form it is easy to reduce the set of equations by Gaussian 
elimination (see Section 8.6.1). The solution is found to be 


T; = —100/9, T, = 95/9,  T,=—10/9, Ss = 3000/9, 
S, = —100/9, S, = 1000/9, Rz; = —2850/9, R,=0 
R, = —950/9, Ps; = 300,9, P, = 100/9, P, = —50/9. 
The actual bar forces are 
10 535 ΞΞ Ὁ 
Ειὶ τ -- τ x V1025, Fy = 2 x V 1000, 
and F; = — 100/9 x V 1100. The negative signs of ἢ and F; show 


that the forces in bars (1) and (3) are compressive. The signs (as well 
as the magnitude) are very important because it is much more difficult 
to construct a bar to sustain a compressive force than to sustain a 
tension force. Hence in this structure particular attention must be given 
to bars (1) and (3) or alternatively the structure should be redesigned 
in order to ensure that all bar forces are of a tensile nature. 


17.3.3 Vibration network. Eigenvalues 
A. Mechanical spring system 


The equation of motion of the simple mass/spring system of Fig. 17.18 
using the notation of the Figure, in which ἐν is the unstretched length 


T m ἃ 
(b) 


~~ ly ne Ὁ... 
! m 


(a) 


Fig. 17.18 
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of the spring, is given by expressing Newton’s second law of motion 

(for constant mass) in the form Force = mass x acceleration. 

d*x 

dt? 

The minus sign indicates that the spring force T is a restoring force. 
For a simple spring, Hooke’s Law (an empirical law) states that the 

tension T is proportional to the extension from its unstretched length, 

ie. T = kx where k is a spring constant called the stiffness. (Note. This 

is sometimes expressed in the form T= Ax//, where A is another 


constant called the modulus of elasticity). Eliminating T leads to the 
differential equation 


1.6. T=-m 


x 
or sa tmx=0, W=kl/m 


The solution of this differential equation is 
x = Asin (nt + «) 


where A and « are two arbitrary constants to be determined using the 
initial conditions (i.e. the conditions under which the motion is 
started). The important point here, however, is that the vibration of the 
mass m is sinusoidal and the angular frequency of the oscillation (the 
pulsatance) is given by n. The period of the oscillation is 27/n and the 
actual frequency is n/2z7. 

Consider now a number of such spring networks linked together. 
The vibration in one mass will cause sympathetic vibrations in the 
other masses, i.e. there will be an interaction between the individual 
networks, For simplicity consider three such spring networks linked 
together as in Fig. 17.19(a) in which the three masses are m,, mz and mg 
and the connecting springs are of stiffnesses k,, k2, Κα and k4. The 
system is fixed at each end and the masses vibrate longitudinally. 


Alpe ~~ <I —- —— -ν"». 
(0) TY ΤΩ Te 13 Ts T4 
Fig. 17.19 
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The Fig. 17.19(b) shows the system in motion. The displacements »,, 
xX, and x3 of the masses are measured from the static equilibrium 
positions and to simplify the subsequent analysis it is assumed that in 
static equilibrium the springs are in their unstretched state. The 
equations of motion of the three masses then take the form 

—T, + Τ᾽, = μι d?x,/dt? 

-- Το + Ts = Mz d?x./dt? 

—T3 + T, = mg d?x,/dt? 
where T,, Tz, T3, and T, are the tensions in the springs. 
Hooke’s law expresses the tension forces as 

T, = ky, — Ο); Tz = kag — 1); Ts = Καίχᾳ — 2) 


T, = Κα — Xs) 
Eliminating 7,, Τῷ and T, leads to three differential equations given by 
m, X, = —(ky + ke)x1 + kore 
Mz X_ = koxy — (Κα + ks)x2 + Καχς 
Ms Χᾳ = kgxX2 — (ks + ks)Xs 
where x means d?x/dt?. 

Now look for that particular mode of oscillation in which all masses 
oscillate at the same frequency n/27 but with differing amplitudes. 
This particular. oscillation is called the normal mode. 

This assumes that 

X, = A, sin (nt + «) 

X2 = A, sin (mt + «) 

X3 = Ag sin (nt + «) 
and means that whatever configuration the system may have at some 
time f, it returns to the same configuration after a period of 27/n, 1.6. at 
time f) + 27/n. Inserting the assumed values for x,, x, and x, into the 
differential equations leads to the set of algebraical equations in A, 48 


and A; 
(ky + ke) ky 


SAS A de 
k k Κ 

—n?A, = — A, -- ΕΣ τὸ a 4 Ay + = As 
Me 
k. (k ae 

SEAS A Ae 


In matrix form this set of equations may be written 
CA =A 
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where C is the coefficient matrix, A is the amplitude matrix and 4 = n?, 
1.6. 


kth 0 
m, my, 
cul .- τὰ _ ke 
~ Me Me Ma 
. Ὁ Kat ke 
M3 Ms 


A; 
and A = | A, 
Az) . 


Note that A is the form of a vector and is often called the amplitude 
vector A. The equation CA = AA is of the form of (84) of Section 
9.11, i.e. A = n? is the eigenvalue of the matrix C and to each eigenvalue 
there will correspond a value of the frequency n. Thus for this system 
there will, in general, be three different modes of oscillation, i.e. three 
natural frequencies of oscillation of the system given by »//. Each mode 
of oscillation will give a different eigenvector which is the amplitude 
vector A. 

In order to examine such a system in detail consider the above system 
with 

m, = 6,mz = m3 = 4 


Κι, =k, τὸ Κᾳ =3,andk, = 1. 
The coefficient matrix C is then 
1 —1/2 0 
C={—-3/4 6/4 —3/4 
0 —3/4 1 
and the characteristic equation for 4 takes the determinantal form 


1-- πὶ 00 
—3/4 6|4-- —3/4 |=0 


0 3/4) 1--λ 
Boh, - eat 0 
or 24. G24. νὴ) a0 
0 -3. 4-- 4λ 


This equation reduces after the usual determinant evaluation and 
subsequent factorisation to 


(A— 144 — 944 — 1) =0 
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The eigenvalues are thus 
Ay = 1/4, Ap = 1, 4g = 9/4, 


and the associated non-normalised eigenvectors are as follows: 


for A, = 1/4, 
2 
AM = 3 
3 

for A, = 1, 

1 
A® = 0 
=| 

for A, = 9/4, 


2 
A® = | —5 
3). 


When the system is oscillating in its first normal mode 4 = 4, = 1/4, 
i.e. at its Jeast natural angular frequency n, = 4 the amplitude of the 
displacements are in the ratio [2:3:3] and so the three displacements 
X1, X2, and x3 may be written 


X, = 2a sin (4t + «) 

χὰ = 3a sin (4¢ + «) 

Xz = 3a sin (4¢ + «) 
where the values of a and « are to be determined by using the initial 
conditions. The period is 47 and is the greatest of the three normal 
modes. The motion is illustrated in Fig. 17.20(a) which shows the case 
for « = 0. 

When the system oscillates in its second mode with A = 1, 1.6. ata 

natural angular frequency nz = 1, the displacements are given 
x, = = dbsin(t+ A) 
x2 = 0 
x3 = —bsin(t + A). 

In this mode the centre mass is at rest and the other two execute 
simple harmonic motion at a period of 27. The values of ὁ and β are 
again determinable from the initial conditions. This mode is illustrated 
in Fig. 17.20(b) in which β is taken as zero. 

Finally, when the system oscillates in its mode of highest frequency, 
ie. A = 9/4, π = 3/2, the displacements are given by 

xX, = 2c sin (3¢/2 + y) 
X_ = —Se sin (37/2 + y) 
Xg = 3c sin (3t/2 + y). 
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The period of the oscillation is 47/3 and the constants c and y are 
again determined from the initial conditions. This motion is illustrated 
in Fig. 17.20(c) (for y = 0) and shows that the outer masses oscillate 
in phase but of differing amplitudes whilst the inner mass oscillates 
with a large amplitude and exactly 180° out of phase. 


Fig. 17.20 


These motions are, naturally, subject to any physical limitations of the 
springs and assume that the springs can be compressed as well as 
extended. 

If the system is set in motion in an arbitrary manner (e.g. by initial 
displacement and/or velocities of the masses) then the subsequent 
displacements x; will be a linear combination of the three normal mode 
oscillations. The three displacements at time ¢ will then be given by 


xX, = 2a sin (4¢ + «) + bsin (¢ + β) + 2c sin (3t + γ) 
X_ = 3asin (At + «) — 5c sin ($t + y) 
X3 = 3a sin (1 + «) — bsin(t + 8) + 3c sin 3t + γ) 


The six unknowns a, ὁ, c, «, β, y will be determined by six initial 
conditions (i.e. will be determined once the initial motion is prescribed). 
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For example if the system is started into motion by displacing the first 
mass a distance ἢ, keeping the other two masses in their equilibrium 
positions, and releasing it from rest, the initial conditions would be 


x, =A, Xo = 0 = Xz, Χι = χορ =X, = Oatt = 0. 


These six conditions applied to the above set of equations lead to the 
six equations 


ἢ = 2asina + bsin B + 2c sin y 
0 = 3asina — 5esin y 
0 = 3asina — bsin B + 3csiny 
=acos«a+ bcosf + 3ccosy 
0 = a cosa - δε cos y 
0 = $acos a — bcosB + 3c cos y 
from which it follows that 
a=h/8, b= 3h/5, c= 3h/40, «= β Ξε γ Ξε πί2. 
B Electrical networks. Filters 


The governing equations of a simple electrical circuit connecting an 
inductance L, and a capacitance C as in Fig. 17.21 are given by 


dQ dQ 
fap on T= aE 
L 
I Cc 
Fig. 17.21 


where Q is the charge on the condenser at time ¢ and J is the current 
flowing in the circuit. 
Eliminating J, gives 
O+nQ=0, n? = I/LC 
where Q means d?Q/dt?. 


The solution of this differential equation is Q = A sin (nt + «) and 
leads to 7 = nA cos (nt + «). 
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It may be observed that the L — C electrical circuit is the analogue 
of the mass-spring mechanical circuit. The differential equations are 
identical and it is possible to draw analogies between the components 
of the two circuits as in the following table. 


Electrical Mechanical 
Inductance, L Mass m 
(Capacitance)~+, 1/C Stiffness k 
ChargeQ- Displacement x 
Current J = ὁ Velocity v = x 
Applied emf £ Applied force F 
Resistance R Damping u 


In both the mechanical circuit and the electrical circuit discussed in 
this section the effect of damping (i.e. resistance or friction) is omitted. 
Also the driving forces of emf or applied force are also not considered. 
Both of these effects can however be taken into account. In a real 
engineering situation it is unlikely for example, that damping (resistive 
or frictional) will be absent. Nevertheless before these additional 
effects are included in the analysis, it is extremely useful to have con- 
sidered the properties of, and important results obtained by, linking 
together simple undamped, unforced circuits. 

The strong analogy between the basic mechanical mass/spring system 
and the inductance/capacitance electrical circuit is the basis of the 
analogue computer. 

The effect on the mechanical system of varying the mass and the 
spring stiffness can be visually represented by making an equivalent 
electrical circuit and varying the inductance and capacitance. The latter 
components can be varied very easily whereas it may be difficult to 
vary the equivalent components in the mechanical system. This analogy 
between mechanical and electrical circuits can be extended in scope to 
include complicated systems. For example the engineer confronted 
with an unwanted vibration (i.e. displacement and/or velocity) in a large 
and complicated mechanical system can frequently represent it by an 
equivalent complicated electrical circuit. Since the electrical circuit 
can be easily constructed in the laboratory the engineer will be able to 
assess what effect the varying of component values will have on the 
unwanted vibration and hence by analogy be able to assess the effect 
on the real mechanical system and hopefully be able to modify the 
mechanical system to remove the unwanted vibration. 

Returning now to the simple L — C circuit, suppose the circuit 
incorporated a switch which was held open with the condenser charged 
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to a value Qo. If the switch was closed at time ¢ = 0, the initial con- 
ditions would be 


O= 0, f= dQ/dt = 0, att = 0. 
Insertion of these conditions into the solutions for Q and / leads to 
Q. =A sina 
0 = nA cos «. 


and hence « = 7/2, A = Qo. 
The charge and current at time ¢ after the switch closure are then 


Q=Q,cosnt, [= —nQ, sin nt. 


The charge on the condenser and current flowing in the network both 
oscillate sinusoidally with a natural (angular) frequency n(=1/+/LC). 
In the absence of resistance there is no damping effect and the oscilla- 
tions are sustained for all time. 


Fig, 17.22 


Now suppose two such circuits are linked together as in Fig. 17.22. 
and let J, and J, be two cyclic currents, then the governing equations 
are 


d, Qn . dQ, 
hat ὅσον ἡ δὲ ae 
die ὃ, On 40. 
τ ὺς ο΄ Ὁ ee ar’? 


where Q, and Q, are the charges on the two condensers. 
Eliminating the currents ἢ and /, and rearranging the equations 
leads to 


LC 0, + 20, — Q2 =0 
LC Q.—-Q, + Q2.=0. 
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To find the natural frequencies in this case it is assumed that Q, and 
Q, are oscillatory with frequency n/2z, i.e. such that 


Q, = A, sin (nt + «) 
Q2 = Ag sin (nt + «) 


With this assumption the differential equations may be rewritten as a 
matrix equation 
2 πῃ 4 λ Ay 
-Ι 1 Ag} “λας 
where 2 = LCn’. 


It is noted once again that A is the eigenvalue of the coefficient matrix 


and that the natural frequency is given by +/(A/LC). The characteristic 
equation for 1 is 


2---λ -1 
-1 1-A 
1.6, λ5-- 3. Ἐ1 Ξ- Ο.β 


The two eigenvalues are thus 


[-ὁ 


3. ν5 3. κν5 
Mg ὩΣ OO Aste νὰ 
If three such circuits are linked together as in Fig. 17.23 the differential 
equations are 


LC 0, +20, — Ος =0 
LC Q2— Q, -- 20, -- Ος τεῦ 
LC Ο. —Q,. +03,= 
Lk L L 


Fig. 17.23 


and the natural frequencies of oscillation are given by n/2a where 
WLC = λ, and λ are the eigenvalues of the matrix 


2 --Ἱ 0 
--Ι 2 —!1 
0 --ἰι 1 
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The characteristic equation for A is λ — 5A? + 6A — | = 0 and the 
three values of 4 are found to be real roots A,, A,, and As such that 
0<4A, <1;51<4,<2;2 <A, <3. 

The numerical evaluation of these roots and hence the natural 
frequencies of oscillation will not be undertaken because it is better to 
investigate this particular circuitry by a different technique. Suffice to 
state that by an alternative approach it may be shown that the N 
eigenvalues 2,, A2,.. ., Ay for N linked circuits are given by, 4, = 
4 sin? (u/2), where w = (Qr — )π|(ΔΝ - 1), r=1,2,...N. 

The natural (angular) frequencies of the oscillations are then given by 

n, = (2/V LC) sin (4/2). 
For example, if N = 2, the two frequencies are 
my = (2/V LC) sin (π|10); ng = (2/V LC) sin (377/10) 
and if N = 3, the frequencies are 
ny = (2/VLC) sin (7/14); ng = (2/VLC) sin (37/14); 
Ng = (2/VLC) sin (57/14). 


The reader may check the validity of these values by inserting the 
appropriate numerical values of 2 into the above cubic equation. 
In the general case of N circuits 


ny = (2/VLC) sin (7/2(2N + 1)); 
ny = (2/VLC) sin (37/22N + 1)), .. 5 
ny = (2/VLC) sin {ΩΝ — 1)7/2(2N + 1}. 


It may be noted that as the number of linked circuits increases, i.e. 
as N — oo, the lowest and highest frequencies have limiting values, i.e. 


2 7 
Lim —= sin = 0 
noo VLC 2ΩΝ + 1) 

d tna =e ONT 3 eT ace 
ses VLC NOONE DY VEC 2 Vie 

The Fig. 17.24 shows the frequencies n,,r = 1,2. . . N for different 
values of N. 

This system with N simple linked circuits has its natural frequencies 
within a frequency range 0 to 2//LC. Thus if, for example, a combina- 
tion of currents at different frequencies are passed into such an elec- 
trical network it will readily accept the currents whose frequencies lie 
within this band but will attenuate those outside. Such a network can 
be used in electrical engineering as a low-pass filter circuit, 1.6, a circuit 
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that will pass the low frequencies in the range 0 to 2/VLC but reject 
those higher than this range. 

If the capacitance C and inductance L are interchanged in the simple 
circuits then it is easily shown that the frequencies within the band 0 to 
2/VLC are attenuated whilst those outside this band are readily 
accepted, and the network becomes a high-pass filter. 

By a judicious choice of components it is clearly possible to make the 
natural frequencies of a network lie within a specified band of values 
and so make an electrical device known as a band-pass filter which 
will readily accept only these frequencies within the specified band. 


ψιο 


0 1 2 3 4 5 Ν 
Fig. 17.24 


The low-, high-, and band-pass filters have many applications in 
electrical engineering, e.g. radar, television reception circuits, telephony 
and telegraphy noise suppression circuits, record player scratch filters. 

It has been noted however that electrical filters as described have 
differential equations equivalent to those of some mechanical spring 
networks and it may also be stated that additionally the equations 
governing acoustical apparatus also have similar differential equations. 
Hence by analogy it should be possible to design mechanical spring 
networks or acoustical devices that permit acceptance or rejection of 
certain frequencies of oscillations. For example, a mechanical machine, 
may have its unwanted oscillations ‘filtered out’? by mounting the 
machine in flexible spring supports and similarly the unwanted acoustic 
oscillations of a car exhaust may be minimised by using a series of 
linked resonating cavities mounted along the exhaust pipe, i.e. a car 
silencer. 
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17.4 VECTORS 


Vectors make their appearance in an engineering context in a number 
of different fields but particularly when the engineering system is in 
motion in 2 or 3 dimensions. The vectors of force, acceleration, 
velocity, and displacement are easily understood and the various 
mathematical ‘equations describing the desired physical motion are 
readily expressible in terms of these vector quantities. As an illustration, 
if a particle of mass m is travelling with a velocity v and is subjected to 
a number of forces Εἰ, F.. . . then the vector form of Newton’s second 
law stating that the total impressed force is proportional to the rate of 
change of momentum is expressible as 


F = SF εἰς (mv) 
dt sia 


This differential equation could be, for example, the fundamental 
equation of a rocket of varying mass subjected to various gravitational 
and thrust forces. 

Vectors are also useful in describing a physical system in statical 
equilibrium under a system of forces. The framework type problem of 
17.3.2 could, for example, have been described mathematically as a 
number of vector force balance equations. 

In this section the application of vectors to the field of crystallo- 
graphy is considered and it will be seen how the earlier work of sections 
9.7 and 9.9 is important in the description of crystal geometry. 

This field is studied by many engineers but particularly by the 
metallurgists and chemical engineers. It is found that the vectors of 
displacement and direction are of importance. 


17.4.1 Crystallography and three dimensional geometry 


The essential characteristic of a crystalline material is the periodic 
nature of its internal structure. The atomic arrangement is related to a 
network of points in space known as a space lattice. The points of the 
lattice may be occupied by single atoms, but usually in a crystal they 
are points at which there are identical groups of atoms. In a perfect 
crystal the outlook from any lattice point is the same as from any other. 
Starting from an arbitrary lattice point O in Fig. 17.25 it is possible to 
draw three straight lines each of which joins with three other lattice 
points, distance a, b, and c away, and form a set of coordinate axes 
(not necessarily at right angles to each other). 

The distances from the starting lattice point 0 to the other lattice 
points may be different, ic. a 4 6 - c. Instead of unit vectors along the 
three axes it is usual to form base vectors a, b, and ς so that any point 
P of space is specified by its position vector r and r = xa + yb + ze. 
The point P is then designated by (x, y, 2). The notation [x, y, 2], with 
square brackets, 1s used in crystallography to indicate the direction of the 
vector r. The three axes are called the crystal axes. Naturally, it is 
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possible to choose the axes in a number of different ways, for example 
for certain crystals it may be possible that two or even three of the axes 
are at right-angles to each other. 

When the axes have been chosen they form a unit cell, i.e. a parallele- 
piped of sides a, δ, and ¢ and each crystal consists of a repetitive 
stacking of unit cells. The volume of this fundamental building block 
is important and is simply the volume of a parallelepiped of sides a, 
b, and ς which is (see 9.9.1) given by Vand V=a.bxc. 


Fig. 17.25 


Points of the crystal which coincide with cell corners are given by 
integral values of x, y, and z but points of the cell faces or inside the 
cell will have fractional values of x, y, and z. 

Any particular direction in the lattice is specified by the direction 
ratios [x:y:z] or [Ax:Ay:Az] where / is chosen to avoid fractional 
coordinates. For example the line from the origin (0, 0, 0) to the point 
A of Fig. 17.25 at the middle of a cell face, 1.6. to the point (4, 4, 1) 
has direction ratio [}:4:1] or [1:1:2] or simply [1, 1, 2]. 

The faces of crystals, and also other cross-sections within the crystal 
lattice, are planes and in crystallography it is necessary to have a 
system of orientation of such planes. The crystallographer uses what 
are called Miller Indices and arrives at a trio of numbers as follows: 


(i) Find the intercepts, on the axes, of the plane in terms of the base 
lengths a, 6, and c. 

(ti) Write down the reciprocal of these numbers and form the smallest 
integers in the same ratio. 
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For example, if a plane has intercepts (4a, 3, 2c) on the three axes then 
the three reciprocals are (4, 4,4) and the Miller indices are (3, 4, 6). 
However, from Section 9.7 it is known that the orientation of a plane is 
given by its normal and hence the Miller indices must be related to the 
normal. In order to examine this relationship it is necessary to introduce 
a space lattice related to the crystal axes and known as the reciprocal 
lattice. The three base vectors forming the reciprocal lattice are desig- 
nated a*, b*, and c* and are given by 


a* = (Ὁ x c)/V; δὲ =(c x a)/V; c®¥ = (a x bD)/V 


where V = (ἃ. Ὁ x c) is the volume of the unit cell. The axes a*, b*, 
and c* are respectively perpendicular to the planes of b and c; ¢ and a, 
and a and b. Note that 


ΔΕ .a=b*.b=c*¥.c= 1 
and a* .b=a*.c=b*.a=...=c*.b=0. 


Fig. 17.26 shows the space lattice a, b, and ς together with the 
reciprocal lattice vector a*. 


* 


Fig. 17.26 


Using the property of the scalar product, with 6 the angle between 
a and a* 


a®*.a=a*acos@#=1 
Le. a* = 1/acos‘0. 


This result shows that the length of the base vector a* is the reciprocal 
of the projection of a onto a*. 

Consider now any plane that has intercepts on the crystal axes Oxyz 
of lengths aa, Bb, and yc as in Fig. 17.27. The Miller indices are the 


smallest integers in the ratio [3, 2, 21]. 
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Fig. 17.27 


Then in vector terms 
OA =aa, OB=fb, OC=yc 


The normal to the plane is a vector n perpendicular to AB and CB, i.e. 
to (Bb — aa) and (γε — fb). Thus (see Section 9.6.2). 


n = (6b — aa) x (ye — fb) 
= By(b X ὃ — ya(a X c) + «f(a x Dd) 
since bx b=0. 
In terms of the reciprocal vectors 
n= Bya*V + ayb*V + afctV 
1 


1 1 | 
= — 2 Ὲ — ΒῈ — o* 
= ofyV (=a 3" 19} | 


It follows immediately that 
1 1 1 
n* = —a* + = b* + - οἴ 
α β γ 


is a normal to the plane. 
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But (1/«, 1/8, 1/y) are proportional to the Miller indices which have 
immediate reference to the normal when the latter is expressed in terms 
of the reciprocal vectors. 

Another important concept of interest to the crystallographer is to 
know the condition that a certain direction specified by [p:q:r] 
is parallel to a given plane of Miller indices (u, v, w). 

Using the previous results the direction [p:g:r] is specified by the 
vector 

r= pa-+qb+re 


whilst the plane has its normal in the direction of r* where 
r* = μα" + vb* + wer 


The condition that [p:q:r] is parallel to the plane is simply that r is 
perpendicular to the normal r*, i.e. tlat 


r.r*¥=0 
i.e. (pa + qb + re). (ua* + vb* + we*) = 0 
1.6. putqt+rw=0 


17.5 COMPLEX NUMBER 


The representation of a complex number z as a vector in the Argand 
diagram and the Euler form z= re” =rcos@ + irsin@ give z a 
geometrical interpretation and also show how a complex number may 
be related to the trigonometrical functions. These two representations 
are essential to the understanding of the way in which complex number 
analysis can be used in an engineering context. The simplest major 
application is in the field of alternating current theory which is concerned 
with the response of a network to applied alternating currents or 
voltages. The elements of this theory are considered in this section. 


17.5.1 Alternating current networks 


The basic alternating current which excites an electrical network is of 
the general form 


i= 10 cos (wt + «) 


in which 10 is the amplitude, ὦ is the angular frequency and « is the 
phase angle. Since this current may be expressed in the form 


i = (hy cos «) cos wt — (Jp sin «) sin wt 


it is sufficient to consider the response of the network to the basic 
current 
i=Acoswt or i= Bsinwt 


Now since (see Section 10.4) 


e7°? = cos wt + 7 sin wt 
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where j? = —1, it follows that these basic currents may be obtained 
by taking either the real or the imaginary part of the complex current J 
in the form 


I = 2 eft 


Note The use of j? = —1 instead of i? = —1 as in Chapter 10 follows 
Electrical Engineering practice where i is invariably used to represent 
a current. 

For example if 4, = 3 and ὦ = 2 then the real part of a complex 
current 3e” is 3 cos 2t and the imaginary part is 3 sin 2¢. 

Further, a more general type current with a phase angle can be 
simply represented as the real part of a complex current. For example, 
the current 


i = 2 cos (4t — 7/3) 
is simply the real part of the complex current J where 


T = 2el(4t- 7/3) 
Le. 1 = (2 7π|8)ς74: 


which is of the form 
T => h eat 


with J, = 2e-'7'3 and ὦ = 4. 

Since the voltage response to a sinusoidal current is also sinusoidal it 
is to be expected that a complex current excitation of an electric 
network will lead to a complex voltage response (and vice versa). 

Consider now the voltage response of a simple network to a current 
of the basic form 


i= 19 cos wt, I, real 
Suppose the current i (amps) flows in the network of Fig. 17.28 which 


consists of a resistance R (ohms), inductance L (henries), and capaci- 
tance C (farads). 


VR vi 
Ve 


Fig. 17.28 
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The voltage response of the elements of this network are vg, v,, and 
v, respectively and, if q is the charge of the capacitance, 


Vg = Ri = RI cos wt 


di . 
v,p=L i —oLh, sin wt: 


1 fT, In. 
vem 46 τ τς fide = sin ot 


If the current i is expressed as the real part of a complex current J, i.e. 
i= Rel = Rel, e** 


then in like manner the voltages vz, vz, and vg are the real parts of 
complex voltages Vp, Κι, and Κο obtained as follows: 
(i) ve = RI cos wt = Re Rly ef°' = ReVp 


i.e. Vr = RI 
(ii) vy = —oL 10 sin wt = Re joLl I, 6791 
= Re V, 
i.e. V,=joLl 
τα hh. j 
(iii) vo = ἘΣ sin wt = Re (- ee 5 et) 
— Re Vo 
chp renee 
a Ve ως = jac? 


In each of the cases (i), (ii), and (iii), the response of the element to a 
complex current / is a complex voltage V of the form 


V=ZI 


where Z is a complex quantity called the impedance of the element of the 
circuit. 


For a resistor R, the complex impedance Ζ is R, and is real, 

for an inductance L, the complex impedance Ζ, is jwL and is 

imaginary, 

for a capacitance C, the complex impedance Z, is I/jwC and is 

imaginary. 

The significance of the representation of the current and voltage 
in complex form may be appreciated after reference to Section 10.1 
where the relationship between vectors and complex numbers was 
investigated and it was noted that a multiplication by j (or i of Chapter 
10) had the effect of a rotation through a right-angle in the positive 
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sense. Let the impedances Zp, Z,, and Z, be represented on the Argand 
diagram as in Fig. 17.29. 


imaginary axis 


Ζι = fwl 


ZR=R real axis 


Zea -ἰ ως 
Fig. 17.29 


The impedance Z, is represented by a vector of length R along the real 
axis whilst the impedance Z, and Z, are represented by magnitudes wL 
and I/wC respectively along the positive and negative imaginary axes. 

The voltages may also be represented on an Argand diagram, and 
Fig. 17.30 shows the complex current J = J, e?* together with the 
voltages Vp, Vz, and Ve 


imaginary 


νι Ξήωι!ϊ 


Ve = - τὼς 


Fig. 17.30 


The Fig. 17.30 gives a geometrical (complex) interpretation of the 
fact that Κ is in phase with the current J, that V; leads the voltage Vz 
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by a right angle (i.e. there is one right angle difference in the appropriate 
sense between the phase angle of V,; and V,), and that Κο Jags the 
voltage Vz by one right-angle. The voltages V, and V, are exactly 7 
{or 180°) out of phase. 

As time increases the angle wr increases and the voltage vectors of 
Fig. 17.30 may be considered as rotating around 0 with angular velocity 
ὦ the relative phase angles being unchanged. The total voltage V 
across the network is the vector sum of Κα, Κι, and V¢ and this is 
represented on Fig. 17.30 where the angle ¢ is the angle between V and 
1. In this simple network, if wZ > 1/wC then the voltage V is in the 
direction indicated in Fig. 17.30 and, because this resultant complex 
voltage is on the same side as the inductive voltage V,, the network as a 
whole is said to show an overall impedance which is inductive and the 
voltage V then /eads I by a plane angle ¢. Should wL < 1/wC then the 
voltage V would be on the same side as the capacitative voltage V, and 
the network would then exhibit an overall impedance that was capaci- 
tative and the voltage V would /ag the current. 

The simple circuit of Fig. 17.28 may be replaced by an equivalent 
circuit as in Fig. 17.31 in which the quantities 1, V, and Z are the 


ν 
Fig. 17.31 


complex current, voltage, and impedance respectively. The complex 
impedance of the equivalent circuit is 


Z=Z,+Z,+ZLe 
, 1 
ως 
3 Ι 
and y=iZ=1(R +) (ox-)| 


Example (i) The actual current 7 in the network of Fig. 17.28 is 
i = 3 cos 4t, find the overall voltage v and also ug. 
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Solution In complex form 
T= 3e74t 


1 
— fat ; —_— 
and V = 3e [κ +j (oz =z) 


Hence v = ReV = Re 3(cos 4t + 7) sin 4t) [κ +j (oz - =a)} 


and v = 3Rcos 4t — 3 (or -- =z) sin 4t 
ως 


The complex voltage across C is Κ͵ο and 
and hence Vo = ReVeo 


Example (ii) The actual current i in the network of Fig. 17.28 is 
i = 2 sin 3¢. Find the voltage v in the caseeR=2,L=4,C=1. 


Solution The current is 
i= 2 sin 3t = 2 cos (3: — π|2) 
the complex current is 


T = 2ef8t-7/2) 


the complex voltage V then has the form 
V = 2673: m2) [κ +j (3 ~ sc)| 


3C 
= 2(-- ε΄ + j(7/6)}. 
The actual voltage v is then 


v = ΚΟΥ 
= 4sin 31 + (7/3) cos 31 


Note It is assumed in the above process that the circuit is excited by a 
current i which, although time-dependent, is in its steady state. That is, 
any transient effects due to switch-on phenomenon have died away. In 
a mathematical sense this means that the circuit switch was closed at 
time 1 = — o., 

Consider now a more complicated electrical network as in Fig. 
17.32 which shows the actual circuit (a) and the equivalent complex 
form (b). When a sinusoidal steady-state voltage of angular frequency w 
is applied across the terminals AB, each of the elements of the circuit 
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(a) (b) (c) 
Fig. 17.32 


has an impedance dependent on that frequency. The complex voltage 
V(= Vo e’*") is applied across AB and the complex impedances are 
given by 
er es 
Zy — oC,’ Ζ, — wC,’ 
The impedances may be combined together by the usual series and 
parallel rules the network then being equivalent to a single impedance 
Z, as in Fig. 17.32(c), such that 


Z,=R, Z,=jol. 


I 1 ἢ 

a= oC; E + Fok A 

Pepe | E + j(@L -- oe] 

a Rj(@L — 1/wC2) 
= JR(@L — 1/@C,) 
oC, R+j(@L — 1/wC,) 
The complex current J is then given by 

I= V/Z 


and the actual current i flowing through the network would then be 
obtained by taking the real part of V/Z. 

When dealing with impedances in parallel it is sometimes more 
expedient to work in terms of the reciprocal 1/Z. This quantity is called 
the admittance Y and it follows that two impedances Z, +.Z, in 
parallel is equivalent to two admittances Y, and Y, in series. 


1.6. Z= 


Resonance of a network 


Consider the simple R, L, C, series circuit of Fig. 17.28 excited by an 
applied sinusoidal voltage of angular frequency w. Suppose the complex 
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voltage is V then the complex current J flowing in the network is given 
by 
I= V/Z 
where Z = R + j(@L — 1/wC) 
The magnitude of the current, [1 |, for given values of R, L, and C is 
dependent on w and reaches a maximum value when |Z [; 1.6. 


VR? + (oL — 1facy 


is a minimum. This minimum value of [Ζ], for varying ὦ, is given when 
ω = ὡρ such that 
1 
WoC 
ie. ὡς = 1/VLC. 


The impedance Z is then purely resistive with the value R. 

The graph of |J| plotted against w has the shape of Fig. 17.33 and is 
known as a resonance curve, the frequency ὡρ being called the resonance 
frequency. 


OL = 


III 


O Wo ω 
Fig. 17.33 


Note that at this resonant frequency the impedance of the circuit is 
purely resistive. This fact provides a simple criterion for resonance of 
any circuit, namely that at resonance the imaginary part of the complex 
impedance Z of the circuit is zero. For circuits with elements in parallel 
it may be more convenient to work in terms of admittance. Now 
suppose 

Y=1/Z=A+jB 


A B 


then 4-74 w/e aR 
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and the condition for resonance is 

ImZ =0 
1.6. Β-0 


which is the same as ΠηΗΥ͂ = 0. Hence for resonance the imaginary 
part of either the circuit impedance or admittance is zero. 


Example Find the resonance frequency for the circuit of Fig. 17.34, 


Fig. 17.34 


Solution The admittance Y is given by 


1 
Y=1/Z= R+joL + joc 
= eee tee 
The resonant frequency wa is given by 
ImY(@) = 0 
i.e. WoL ](R? + @o2L?) = WoC 
i.e. Wy = VI/LC — (R/L)*. 


Note that this frequency differs from that of the simple R, L, C series 
circuit but that it approximates to 1/./LC if R -- 0. 

Resonant circuits play an important role in the electric circuitry of 
radio, television, measuring and filtering equipment. Consider for 
example the network of Fig. 17.22 of Section 17.3.3. The impedance of 
such a network when an input voltage of frequency ὦ is applied, is 
given by 

Z=joL+Z, 
1 1 1 


ak Z,— > ΠΤ * job —jJot 


ΒΝ ΡΟΝ 
αὐ ον ΠῚ ΕΣ Gs 


APPLICATIONS 509 
and hence Z becomes 
Z = joL — [ως — 1f(@L — Π[ω}} 1 
For resonance it is required that JmZ = 0, i.e. that 
᾿" 1 
“ὡς --Ί (ΩΣ, — 1/@C) 


and at the frequency given by this equation the circuit presents a (zero) 
resistive impedance. 
If w?LC = A, then the resonance condition is 


ΟἹ, 


Ι 
1S 1) 
ie. λ5 —3A+1=0. 


This gives the two natural frequency of resonance as already obtained 
in the earlier work on filter theory of Section 17.3.3. (see page 493). 


The A.C. Bridge 


Consider a network of impedances as in Fig. 17.35 with complex 
cyclic currents 1, 15, and [, due to a complex voltage V all at frequency 
w. The network equations are given by 


LZ, + (, _ 1.)Ζε + 0 = 1.)Ζς ΞΕ 0 
(Ip — 1)Z; + {2 + (I, ae 1)Ζς ΞΞ 0 
(UIs — 1)2Ζ, + Us ττ T,)Z3 = Κ. 


Ge 
ν 
Fig. 17.35 


This type of A.C. bridge is adjusted by varying the impedances so 
that the current flow in the indicator Ζς (e.g. ear-phones) is zero, This 
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requirement is ἢ — J, = 0, with ἢ, 15, and J, non-zero. With zero 
current in Z; the first pair of the above three equations reduce to 


L(Z, + Zs) — 13Z, = 0 
I(Z_2 + Z3) — 13Z3 = 0 


from which it follows that J,, 15 are non-zero providing 


Ζι + Zs Z4 = 0 
ZZ FZ. 

1.6. Z,Z3 = Ζ.Ζ, 

or Z,/Z, = Zo/Z3. 


Thus the requirement that there is no current flow in Zs, i.e. that the 
bridge is balanced, is that the impedance in the four arms of the bridge 
are in the above ratios at the frequency ὦ. Since the impedances are, 
generally complex quantities then by taking real and imaginary parts 
of the condition, two relationships may be obtained. 

The types of A.C. bridges and the uses to which they may be put 
are many and varied. As an example consider the particular bridge of 
Fig. 17.36 known as the Wien Bridge in which the resistance R, and 
capacitance C, are unknown and k is a real positive number. 

Relating Figs. 17.35 and 17.36 then 


Z=R+ 1/joC,, 
Z,= Ro, Zz; =kRo, 


1 


ΖΞ TR + joc 


and the bridge is balanced when 


1 Ι 
(ἃ ταὶ = κι (Tez Gos) 
' j Sie τὰ hee 
i.e. k [{μ, - +) (1/R + jo0)| =1=1+/0. 


Equating real and imaginary parts gives two equations 
k(R,/R + C/C,) = 1 
wCR, = 1/wC,R 
and, solving for C, and Καὶ gives 
C, = kCU + 1/?C?R?) 
Κι = R/k(L + w?C?R?). 
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The values of C, and R, can therefore be found at frequency w. Alterna- 
tively if Cy and R; are known it is possible to use this device to measure 
the frequency ὦ of an oscillatory voltage. 

Suppose that Οἱ = Cand R; = R (this is easily achieved by a ganging 
arrangement) then when the bridge is balanced the above equations 
lead to 

k=} and w=1/CR 


Thus if k is given the value of 1/2 and the resistances R,, R and capaci- 
tances C,, C are coupled together so that they have identical known 
values the frequency of the oscillation is obtained by calculating 1/RC 
when the bridge is balanced. 


Cy 


Vv 
Fig. 17.36 


17.6 DIFFERENTIAL EQUATIONS 


Engineering is frequently concerned with the rates of change of certain 
variables and the physical problem is often expressed simply as an 
equation of balance, e.g. a balance of mass, or heat, or energy. The 
balance equation itself may in the physical sense be a simple statement 
such as 

Input — Output = Accumulation 


over a specified time interval, or even an empirical law such as 
Force = Mass x Acceleration. 


When the statement is expressed using mathematical symbols it often 
represents a differential equation (derivatives being involved because of 
the involvement with rates of change). In this way a simple physical 
statement may lead to a (sometimes complicated) differential equation. 
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If this equation can be solved, either analytically or numerically, then 
the engineer must interpret the mathematical solution in.terms of the 
physical variables. Consider now two engineering type problems which 
lead to differential equations. 


17.6.1 The oil purifier 


The oil used in lubricating a ship’s engine becomes dirty after a 
certain time of running. It is not always possible, or necessary, to 
change the oil and a purifier may be installed within the ship. This is 
usually of the centrifuge type in which the dirt particles and other 
impurities are removed by centrifugal action. 

The mechanical process consists of passing the impure oil from the 
engine into a drain or settling tank where some of the larger impurities 
can be removed by gravity. The oil from the tank is then fed to the 
centrifuge where some of the impurities are removed and the part- 
purified oil is returned to the drain tank. The oil is returned from the 
tank via an output pipe to the engine. The process is illustrated in 
Fig. 17.37(a). 


part —purified from engine 


to engine 


impurities DRAIN TANK 


centrifuge 
Fig. 17.37(a) 


km8dt 


DRAIN TANK 


cmdt 
ς 


Fig. 17.370) 
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The physical situation is analysed mathematically by forming a 
balance equation on the mass of impurities in the system. The Fig. 
17.37(b) shows the input and output of impurities in the drain tank. Let 
p be the net mass of impurities entering the tank per second; let c be the 
concentration of mass impurities in the tank at time £; let k be the 
concentration of mass impurities returned to the tank from the centri- 
fuge; let m be the mass of oil entering the centrifuge per second; let 
M be the total mass of oil. The physical mass balance equation for the 
impurities in the drain tank (see Fig. 17.37(b)) is 


mass input — mass output = mass accumulation. 


If the concentration of impurities in the tank changes from c to c + ὃς 
in the time interval ¢ to ὁ + δὲ then the mathematical statement of the 
balance equation is 


{pot + k(mér)} — {cmdt} = {M(c + 6c) — Mc}. 
Dividing by δὲ and proceeding to the limit as δὲ > 0, 


Me pee 
Roe m—cm 


: ἄς m Pptkm 
1.6. TE + M c= ay ΣΝ 

which is a differential equation of the form of equation (15) of Section 
11.1.5, ie. dy/dx + ἂν τ Q(x) with y=c; x=t; a=m/M, 
Q(x) =(p +km)/M. The solution is given by equation (18) of 11.1.5, 
1.6. 


ἐξ Α εσπὴμ 4 (py + km)/m 


where A is the arbitrary-constant-of integration. Assuming that the 
initial concentration at t = 0 was c = cp then 


C= A+(pt+km)/m 
and hence c= coe ™™ +4 (k + plm\(l — e ™/), 


The graph of c (the concentration of impurities in the oil) as a function 
of ¢ is illustrated in Fig. 17.38. 

The ultimate (i.e. t-> 90) concentration is given by (kK + p/m) and 
even with no input from the engine (p = 0) it is impossible to reduce the 
concentration below the value k. Any further reduction in concentration 
can only be achieved by reducing k, i.e. designing a more efficient puri- 
fier. Writing 


(¢ — k — Ρ[ 1) = (co — Καὶ — p/m) Θ᾿ πὴ Μ 
the concentration will fall half-way towards its ultimate value when 


—~mtiM — 1 
[Ὁ ΞΞ: 5. 
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Co 


Fig. 17.38 


1.6. the half-way concentration is reached after a time t = (M/m) log 2. 
This half-life time can be reduced only by increasing m (for a given 
total mass M) and this again modifies the design of the purifier. This 
design is optimised by taking all possible factors of the centrifuge into 
account, e.g. the speed, the maximisation of the input, the maximisation 
of the purification and the physical size limitation. 


17.6.2 Chemical reactions 


Reactions between chemical materials to produce a yield is a process 
which is of great importance in the chemical industry. The types of 
reactions are many and varied. Consider first only the simplest kind of 
reaction in which a chemical material A transforms into another 
material B possibly by reacting with a catalyst (which itself undergoes 
no change). There is assumed to be no change in volume of the system. 

If the concentrations of materials A and B at time ¢ are c, and cz 
respectively then in a time interval δέ the mass balance equation (i.e. 
mass flow in — mass flow out = mass accumulation) is given by (see 
Fig. 17.39) 

0 — Χο, δὲ = (c4 + dc,4) — c4, for material A 


and ke,6t — 0 = (cg + δαρ) — 65. for material B. 


where Κὶ 15 a constant called the reaction constant or rate coefficient. 
Dividing by δὲ and proceeding to the limit leads to a pair of differential 
equations, 


dc, 

“dt = ke, 
εἰ 

sled ΣῈ +ke, 
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If the initial concentration of A is a then the first equation is easily 
integrated to give c, = ae~**, If further the initial concentration of B 
is zero the second differential equation leads to cz, = a(l — e-**), 


t, Ca t, CB 
k 
A 
t+ 8t, qt 8c, t + δὲ, cgt 8c, 
Fig. 17.39 


Note that this simple type of reaction proceeds until eventually 
(t > 00), c, reduces to zero and c, is a, ie. all of A is converted into B. 

A more complicated situation arises in which B may react and 
convert back into A but at a different rate. This situation is illustrated 
in Fig. 17.40 in which the forward rate k, is different to the backward 
rate Kp. 


Fig. 17.40 


The equations of reaction are now found to be 


dc 
ae = keep — kyc4 
εἰ 

and - = ke, -- kets. 


These differential equations can be solved by noting that 
de,/dt + dc;/dt = 0. 
Thus, C4 + Cz = constant 

=a+b 


where a and ὁ are the initial concentrations of c, and Cz respectively. 
Substituting into the first differential equation leads to 


dc 


a = k(a+b—ca)—kyc, 


d 
ie. — + (ky + κῶς, = ka + δ) 
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which has the solution (see Section 11.1.5) 
C4 = Ee tat + koa + by(ky + ke) 


where E is the arbitrary constant. The initial condition on cy, i.e. 
C4 = a,t = 0, gives 


E=a—k,a + b)(ky + kz) 


and hence the complete solution is 


_ (a+) . et ΉΤΟ 
ὑπ πα eek 


Cp =~atb—C,, 


where K = k,/ko. 

The limiting (1.6. t— co) concentrations of c, and cg, are then 
(a + 6/1 + K) and K(a + 5)/(1 + K) respectively. 

Note that in the particular case b = 0 and k, > ἄς (ie. kz > 0, 
K-» o) the concentrations become 


Cas ae kt 
and Cp = al — e7*4) 


which are the results already found for the simple forward reaction. 

In the simple forward reaction the process progresses by converting 
a molecule of substance A into B and the reaction is called one of the 
first order. A more general reaction would be one such that two 
reacting substances A and B (say) are brought together and one mole- 
cule of A reacts with one molecule of B to form some resultant product 
(or products), i.e. 

A + B— Products. 


Generally the reaction rate is dependent on the concentrations of both 
A and B and a generalised form of the governing differential equation 
for the rate of change of A is 


oe = —kic4%Cp* 

where «, and β are the reaction orders with respect to A and B respec- 
tively and « + β =a 15 the overall reaction rate, and k,, is the rate 
constant. In a general reaction « and f need not be integers, but 
usually ἡ < 3. The reaction orders (i.e. the values of « and #) are 
usually deduced from experimental observations. A particularly useful 
law followed by a number of reactions is that for which « = β = 1, ie. 
n = 2 and the reaction is called second order. The governing equations 
for this reaction are 
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This differential equation may be integrated on noting that 
d 
at (C4 — Cp) = 0. 


i.e. C4 — Cz = constant = a — ὃ = J (say) 


where a and ὃ are the initial concentrations of A and B respectively. 
The differential equation for c, then has the form 


de. 
dt 


which is of the type of Section 11.1.2, 1.6. variables separable. Hence 


dc, 
i) Ce -| att 
1 


1 
SE lS = =| dc4 = —Ak,t + constant 


= —kyc4(e, — A) 


which leads to 


where D is an arbitrary constant of integration. 
Inserting the initial condition that c, = a, t = 0 leads to the evalua- 
tion of D and hence to 


a-—b 


cA 1 — (b/a) δ σ-καία - byt" 


More complicated reactions may take place and possibly unwanted 
effects may occur, for instance although it may be required to convert 
material A into material B it might happen that a further material C 
is produced by a forward reaction from B as in Fig. 17.41(a) or by a 
side reaction from A as in Fig. 17.41(b). 


Fig. 17.41 


Consider as a final example the case of a reversible reaction A to 
B taking place and an unwanted irreversible side reaction to Y also 
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taking place as in Fig. 17.42 where k, and k, are the reaction constants 
A to Band B to A respectively and k, is the reaction constant from A to 
Xx. 


Fig. 17.42 


The governing equations are 


dc 

ons = —kye, + kgcy — koe 
dc 

a = kyc4 = k3cp 

dc 

ἘΣ ΞΞ kel. 


The first pair of equations do not contain cx and hence can be solved 
for c, and cg. Write D = d/dx, then 
(D + ky + keen πο Κ368 = 0 
—kic, + (D + Κ3)68 ΞΞ 0. 
Operating on the first equation with (D + k3) and multiplying the 
second by kg and adding the resulting equations leads to 
{(D + Κρ) + ky + ke) — kiks}e, = 0 
or (D? + KD + Lic, = 90 
where K = ki + kg + kg and L = Καί. 
This second order differential equation has the solution (see Section 


11.4) 
C= Ρε οι + Q ο΄ δὲ 


where P, Ο are arbitrary constants and «, f are given by 


K+vVK?—4L 
= Oe 
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and are both real and positive. The concentration cy may now be 
found from the first of the governing equations, i.e. 


d 
Κβ6ρ ΞΞ ac + kyce4 + Κις... 


= P(k, + ke ἔσο a)e7 δ! Ἔ Oh, + ke = Bye 4 
and finally cx from the last of the governing equations 


Cx =R a MaP os __ Ha os 
α β 


where R is a further integrational constant. The 3 arbitrary constants 
P, Q, and R may be determined from the initial conditions of con- 
centrations, i.e. if a is the initial concentration of c, and the initial 
concentrations cz and cy are both zero then 


a=P+Q 
0 = P(ky + kg ~ αὐ + Ok, + ke — β) 
0- καὶ -- χφρία — k,Q/B. 


giving three equations for the three constants P, Q, and R. 


17.7 PARTIAL DIFFERENTATION 


Many physical quantities are functions of several (i.e. 2 or more) 
independent variables, e.g. the volume V of a circular cylinder is a 
function of its height ἡ and radius of cross section r and V = mrh; the 
pressure p of a perfect gas is a function of temperature Τ' and volume p 
and p = RT/v where R is a constant. 

Engineering situations are frequently concerned with the change of 
one physical quantity with respect to variations in the other variables. 
These rates of change lead to partial derivatives, e.g. if Pp = RT/v then 
the rate of change of pressure at constant volume Ὁ is ép/0T while the 
rate of change of pressure at constant temperature T is ép(év. Likewise 
it is possible to have other derivatives such as 07; /ép and dv/OT. Con- 
sider now two fields of engineering study that demand the use of partial 
differentiation. 


17.7.1 Elasticity. The displacement 


Suppose a straight bar of elastic material is lying along the x-axis from 
x = Oto x = L. Let the end x = 0 be fixed and the other end be subject 
to a longitudinal force that causes the bar to extend. The Fig. 17.43(a) 
shows the static equilibrium state and Fig. 17.43(b) shows the situation 
after stretching. Because this motion takes place in one-dimension, i.e. 
that of the x coordinate, it is to be expected that any rates of change 
in this case will not lead to a partial derivative. 


520 MATHEMATICS FOR ENGINEERS AND APPLIED SCIENTISTS 


xrtut Sat du 


Fig. 17.43 


The point P moves to a new position P’ and is said to have a displace- 
ment of u. The point Q (originally at x + 6x) takes up its position at Q’ 
(ie. the point x + u + dx + du), and the element PQ (originally 6x) 
has changed to P’Q’ (now of length dx + δι). Different points P along 
the bar will have different displacements, i.e. u is a function of x, 


u = f(x). 
Then 
du = f(x + dx) — f(x) 
d 1 d? 1 d° 
= Lox 4 thon + ow +. —_ 


after expanding f(x + 6x) by Taylor’s theorem. 

In the simple theory of elasticity it is assumed that the change in u, 
i.e. du, is small enough to be represented by the first term in the series 
1.6. 


du 
du = he Ox 


This function du/dx is called the rate of displacement or the displacement 
gradient and has an important role in Elasticity theory. Interest is 
centred on the change in length of the small element PQ and as a 
measure of this change it is usual to compare the change in length 
(Ρ΄ ο’ — PQ) with the original length PQ and to examine this ratio 
as the length element PQ tends to zero. This quantity is called the 
strain at the point P and takes the symbol ε, i.e. 


. (dufdx)dx +... 
m -.-.ς.--Ἐ-.-.-.  -.-.--.-ὕ---- 


ὃχ- Ὁ ὃχ 
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The ideas inherent in this measurement of strain can be extended to 
two-dimensional elasticity. Suppose a sheet of elastic material occupies 
a part of the usual two-dimensional space of x and y and is distorted by 
a tension. The elementary length dx in the x-direction undergoes a 
change but so also does an elementary length dy in the y-direction. 
The Fig. 17.44 shows a small rectangular element PORS of the elastic 
medium such that P is the point (x, Jy) and the sides of the element are 
ox and dy. After a tension is applied to the medium the point P moves 
to P’ (x + u, y + v) and the other points take up new positions as 
indicated in Fig. 17.44. 


Fig. 17.44 


The distortion consists of 3 distinct parts, firstly a translation as a 
rectangle so that P takes up the position P’, secondly an elongation 
of the rectangle to the shape P’Q”R”’S” and thirdly a rotation of the 
Sides to take up new position P’Q’R’S. The first part, 1.6. the trans- 
lational movement may be ignored because the relative position of 
points are unchanged. The third part of the distortion is represented by 
the angles « and β of Fig. 17.44. 

The quantities uv, v are the components of the displacement PP’ and 
are functions of position, ie. u = u(x, y) and v = v(x, y). 

The coordinates of the points relative to P are as follows (assuming 
α and β are small): 


Q: (6x, 0) 

S: (0, dy) 

P’: (u, v), i.e. (u(x, y), v(x, y)) 

QO’: (6x + u(x + dx, y), v(x + dx, y)) 
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ο΄: (6x + u(x + dx, y), υ(χ, y)) 
S's (u(x, y + dy), ὃν + o(x, y + ὃν} 
S”: (u(x, y), dy + v(x, y + Oy). 


The component of strain in the x-direction is designated ¢,, and is the 
unit change in length of the elementary length Ox, i.e. 


ee ge eee) 
porn PO 
Bee eS" 
οο ἢ᾿ἘῸ 
Now 
P’Q” = 6x + u(x + 6x, y) — μία, y) 
ou 0?u (6x)? 
= 6x + Ox 6x ra ey 


using the Taylor expansion for a function u with one variable y held 
fixed. 
Thus 
, (μι δχ)δχ 
ει πΞΙσπ- Paks 
ἼΔ χοῸ ὃχ 
= δι [ὃχ. 
In a similar manner the strain component in the y-direction is 
designated €2 and is given by 


gate 
22 ps0 PS 
P'S” — PS 
= Lim —>5— 
ἐπὶ PS 
δυ[γὴδ 
= Lim (@aleyy +... 
dy—0 dy 
= dv/dy. 


The extent of the shearing action which has distorted P’Q”R’S” to 
P’Q’R'S' may be examined by forming the difference between the 
right-angle QPS and the new angle Q'P’S’, i.e. in Fig. 17.44 by the 
angle (« + £). 


, " 


Now a = tan α == Lim “pros 
but Q'O” = v(x + dx, y) — of, y) 
ov 6?v (6x)? 
= FO + 5 2! 


and P’Q” = dx + du/ax dx. 
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Hence, assuming that Gu/éx <1, then 


av/ax)d 
ieee (év/0x)dx 4 


ox 
= dv/éx. 
Similarly 
B = ϑιίδν 
and the shearing effect may be measured by 
_ Ww. ou 
eS Ox sp ὃν 


This quantity is designated y,. and is usually called the engineering 
Shear strain. 


Note (1) The suffixes 1, 2 refer to the x- and y-directions respectively. 
Some texts designate the strain components as é,,, e,,, and 


Yay 


Note (2) It may be observed that γι can be defined by interchanging 
the roles of x, y and of w, vin yy, 1.6. yo) = δμ ὃν + dv/ax and 
it follows that γ1: = yp). 


These ideas may be extended to a three-dimensional elastic solid in 
which a point (x, y, z) is displaced to a new point (x + u,y + v,z + w). 
The strain components are then given by 


€11 = Ou/ox 712. = Culdy + dv/ox = yay 
Ego = Ov/dy Yi3 = Oulez + ὃν) δχ = ya, 
&33 = Ow/0z Yas = Ovfoz + dwldy = yoo. 


From a mathematical viewpoint it is more convenient to use the alter- 
native notation (x,, x2, X3) for (x, 7)» 2) and (u;, ug, ug) for (u, v, w). In 
this notation the suffixes (1, 2, 3) refer to the three directions of x, y, 
and z. 

Then 


11 = Ou, /Ox, Y12 = 0u,/Ox, + Oun/Oxy = yoy 
ἔς = Ou2/OX2 Yis = θη χα + Oug/Ox, = yo; 
€33 = OUg/OX, Y23 = Ou2/Ox3 + Oug/Ox_ = Yoo. 
If further the mathematical shear strain is introduced by writing 
12 = 2Vie = 3(6u,/Ox2 + Guz/Ox,) 


er 
13g = 5718 


1 
E23 8723 
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then the six (mathematical) strain components may be written as a 
strain matrix ει)» i,j = 1, 2, 3, Le. 


11 12 &§13 
(δι) = {821 €22  &23 
\€31 €32 838 
in which ¢,; = 3(0u;/@x; + du;/Ex;). 
This particular matrix array (¢;;) has further properties which are of 
importance in the mathematical theory of elasticity. In the form (¢,) 
the above array is known as the strain tensor. 


17.7.2 Fluid dynamics 

A. The material derivative 

Suppose a fluid (e.g. gas or liquid) particle at time fp is at a point Py 
whose position vector is R with reference to a fixed point O. At time ¢ 
suppose the same fluid particle is at the point P whose position vector 
is r. The Fig. 17.45 shows the path of the particle from ἢ to P. Note 
that, for fixed R, the vector r is a function of ¢ only andr = F(£). 


ν 


οὔ 
ot 


Fig. 17.45 


The velocity V of the particle at P is given by 


dr od 
V= “Ξ 7 FC). 
In Cartesian coordinates with O as origin, r = xe; + ye, + Ze; and 
the three components of the velocity (1.6. u, Ὁ, and w) are u = dx/dt; 
v = dy/dt, w = dz/dt. The particle at point P of the fluid has associated 
with it other vector functions such as acceleration and force and certain 
scalar functions such as pressure, density, and temperature. Consider 
first one of these scalar functions, say pressure p. 
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The pressure p at any point P may be considered as a function of 
both its position and of time, e.g. the pressure varies over a region at 
fixed time and also varies over a period of time at a fixed point. Hence 
p may be written p = f(x, y, z, t). However, since r is a function of t only 
it follows that x, y, z, are functions of ¢ only and the above relation- 
ship may be re-written 


ΡΞ ΠΠχ(ἢ, yO), 20), ἢ 
= φ(ὴ 
where ¢ is the function f when expressed in terms of t only. 

It is possible to differentiate p partially with respect to x, y, z and f¢, 
ie. Of/Ox, Offéy, Of]éz, and é@f/ét and it is also possible to differentiate 
p as a function of ¢ only, 1.6. dd/dt. 

An application of equation (27) of Section 5.6.2 in which n = 4, and 
X1, X2, Xg, X4 are replaced by x, y, z, t leads to 

db of dx ofdy . ofdz . ofdt 


dt éx dt" éydt * δα! ot dt 


or in terms of the pressure p 


where u = dx/dt, εἴς. 

It is worth stressing that although the same symbol P appears on 
both sides of this last result the function represented by p is different. 
It is very important to be able to distinguish between ép/ét which 


means 
a 
[3 ω.»..2] 
et X,Y.z fixed 


and dp/dt which means (d/dt)d(t). The term ép/ét on the right side of 
the above result represents the rate of change of pressure with respect 
to time when P is considered fixed, the other terms of the right side 
represent the rate of change of pressure at fixed time ¢ as P moves with 
the fluid. The left side is then the total rate of change of pressure. This 
type of differentiation is often called “differentiation following the 
motion of the fluid” or simply “the material derivative”. 

In exactly similar fashion any of the other scalar or vector functions 
which depend on position and time may be analysed. For example the 
density p may be differentiated to give 


and the velocity V may be differentiated to give 


dV_ ὃν ev ov ev 
di δὲ ποι ape oF 
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This latter result being an expression for the rate of change of velocity 
is simply the acceleration of the fluid particle at the point P at time ἢ 
and naturally plays an important role in the derivation of the equation 
of motion of a fluid. Although the equation of motion will not be 
derived here it is possible to state that an application of Newton’s 
Second Law, that applied force is proportional to rate of change of 
momentum, can lead to an equation of the type 


where R represents all the forces acting on a unit of mass of the fluid 
at P (e.g. body forces due to gravity or spin, pressure forces, viscous 
forces, electromagnetic forces). 

Finally, note that ép/0x, ép/éy, ep/az are the 3 components of grad p 
and that, since (wu, v, w) are the three components of V, 


"Ox + oy ἐδὼ az 
may be written as a scalar product 
V. grad p 
or V. Vp. 


The operational form of the material derivative plays an important 
role in fluid dynamics and has the form 


Ce ee ee 
dt ot Moe gy toe 
0 
or ΞΞ δὲ ἘΝ. grad) 

ὃ 
or =, + .¥). 


B. The Continuity equation 


An important concept in fluid dynamics is that, although the material 
is moving and its volume and density are changing, the mass of any 
given volume is conserved, 1.6. that mass is not annihilated or created. 

It is possible to arrive at the mathematical equation of conservation 
by considering any arbitrary shaped volume in the fluid but the 
mathematical processes involved are outside the scope of this book. 
However the same equation can be obtained for a particular shaped 
volume of fluid, namely a rectangular parallelepiped. The physical 
property of mass conservation is simply a balance equation over the 
rectangular parallelepiped that 


mass flow in — mass flow out = mass accumulation 


and the mathematical statement is a partial differential equation. 
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Suppose P is the point (x, y, z) of Fig. 17.46 and the sides of the 
rectangular volume are parallel to the x, y, and z axes and of lengths 
ὄχ, dy, and dz respectively, e.g. W is the point (x + dx, y + dy, 
z + 6z). At P the fluid velocity V(x, y, z, t) has components (u, v, w) and 


Fig. 17.46 


the density is p(x, y, z, t). Consider the flow in the y-direction across the 
face PORS. The only contribution to this flow info the region is due to 
the velocity component v in the direction y. The other two velocity 
components wu, w being parallel to the face PORS do not contribute 
anything to the flow into the parallelepiped. To the first order of small 
quantities it may be assumed that the velocity component v is constant 
over the face and hence the mass flow into the region in time δὲ is 
(density) x (area) x (velocity) x (time), i.e. 


(φύγει xy,2 ὃχ ὃΖ δὶ 
The flow out of the region over the parallel face UV WT at y + dy is 
(PU)at x,y+dy,2 OX Oz Ot 
The function (pv)x.y+0y.z has a Taylor expansion (using equation (18) of 


Section 5.4 with h = 0, k = dy and inserting a third variable z), i.e. 


ὃ 
(PU) x.y 45y.2 = (pr)at XYZ ΞΕ ὃν By PY) 2 eee 
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Retaining only the first two terms of the expansion the difference (mass 
flow into region over PQRS — mass flow out of region over UV WT) is 
given by 


ὃ 
- By (pv) dx dy 6z Ot 


Calculating the similar results for the other four faces leads to a total 
contribution of (mass flow in — mass flow out) given by 


-ἰξ (pu) + = = (pt) Ἐξ; = (ow) | dx ὃν 6z Ot 


During the time interval t to t + δὲ the density within the volume 
changes from p to p + dp and the mass accumulation is (Density 
increase) X (Volume), i.e. 


Op Ox dy 6z 


(Note this assumes that the density over the region is uniform which 
is strictly not true, but the error is small, if dx, dy, dz are small, and 
decreases to zero as the volume shrinks to zero.) 

Forming the balance equation leads to 


dp dx dy dz = — (5 (pu) += = (60) + 5; = (P")] Ox ὃν dz ot. 


Dividing by the volume element a ὃν 6z) and by δὲ and proceeding 
to the limit so that dx, dy, dz, dt + 0 and remembering that is function 
of (x, y, Z, t) leads to 


ὃ 
ats = (ou) +o = (pt) ἘΞ ° (pw) = =0 


which is the equation of »- or the mass conservation equation. 
An alternative form may be obtained on differentiating the product 
terms and using the material derivative, i.e. 


ep oe ov. ow 
at & uP pote +p eet ap +=)]=0 


ne at + ρ ay ῈΡ oz} 

This differential equation together with the equation of fluid motion 
considered earlier in this Section form two of the most important 
equations of fluid dynamics. 

Finally, it may be noted that in the simple case of liquid flow (e.g. 
hydraulic motion) the density is constant and the continuity equation 
reduces to 

ὃ dv. ow 
ΟΧ ὃν oz 


Answers to Exercises 


EXERCISE 1 (page 17) 
1 44; 0; x + 1/x?; e+ A)? + 1|ἃ + A). 
2 . . 


3 (χε ( Ὁ γ)])( -- γ),χ αὶ πτί,γὶ 
(ὃ x = YG? — 1),x <O0orx> l,y 4 +1; 
Gil) x = -y + V3(4 — y9)], --2 « <2, -2<x <2; 
(iv) x = εν} — 1], x #+1,y SOory>1. 
4 (ἢ continuous; (ii) not continuous; (iii) not continuous; 
(iv) not continuous. 
8 (Ὁ -3 <x <3;(i)x< -3 or x > 3; (iii) -2<x<6 
(iv)O<x <2. 
9 3/2; -- ν,3|3; 2/3/3; —2. 
10 +/3/2; — 1/3; —2; 24/3/3. 
11 (i) σ; (ii) $7; Gi) $n — π᾿ = — do, 
12 (ὃ straight line parallel to the z-axis; 
(ii) straight line 4 units above and parallel to the line y =~ in the xy- 
lane; 
(iii) eee in the plane z = —3 centre (0, 0, —3) radius 3; 
(iv) ellipse centre (0, 0, 0) in the xz-plane, semi-axes a, ὁ. 
13 (i) circle centre in Cartesian coordinates (0, 0, 2), radius 3, in the plane 
z= 2; 
(ii) straight line from the origin in the plane φ = ἐπ making an angle 
45° with the z-axis. 
14 (i) Circle radius 5, centre the origin lying in plane φ = }z; 
(ii) the intersection of the sphere r = 4 and the cone 6 = ἐπ is a circle 
of radius r sin 6 = 24/2; 
(iii) the plane φ = $= and the cone θ = ἔπ intersect in a pair of straight 
lines from the origin; 
(iv) a semi-circle radius 2 in the yz-plane centre (0, 0, 2) on the z-axis. 
15 G)r = 2, p? + 23 = 4; (ii) r = 9 cos 8, p? + 27 = 92; 
(iii) 2? = x? + y?, 9 = bn, 0 = ξπ; 
(iv) x? + y? + 2? = 42, p? + 2? = 4z, 


EXERCISE 2 (page 45) 

1 14;0. 

2 2; —2. 

3 (i) 2 sec*x/(1 — tan x)? or sec%(4a + x); 
(ii) sin x cos x//(1 + sin? x); (iii) cos 2x — 4 sec? x, 

4 y(—2) = 3; when x = 0,γΞ 28, when y = 0,x = —4; point of inflexion 
(—2, 14); y = x? for large x. 

5 sin ¢/(1 + cost). 

6 (1 + 2/0 + OP. 

7 3. 


8 2γ -- 9χ +9 =0. 
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9 (2, 4) and (—2, —4). 
10 n = 2k, 3"(—1)*; n = 2k + 1, 35:3 n(n — 1) — 2)(—1)*-? 
11 maximum (3-7, 6-4) minimum (5-8, 5-0) approximately; common tangent 
y —x + 1=0; curve crosses x-axis between x = 0 and x = {π᾿ 
12 


ς ty faz fyy fay = γα 

1; —x/y? 0 2x/y3 —1/y? 
xx? $ y%) 82 yx? 4 γ 912. (2x2 — γϑ)ϑ Ὁ γ5γ. 512 Cy? — x 2y¢x2 $f y2y~ 5/2 3x yx? Ἐν 55 
cosy—ysinx —xsiny + cosx —ycos x —x cosy —sin y — sinx 


13 6?u/ax? = {n( + 2)x? — nr} /r7*4 

14 Hint: Differentiate each equation implicitly partially with respect to x 
and partially with respect to y. 
—(u + v)/(u? + v4), Qu — 3u/2(u? + υϑ), (ὦ — w)/u? + v), 
(2u + 3v)/2(u? + v?). 


EXERCISE 3 (page 61) 
1 (i) 0; since there exists an interval (—A, ἢ) containing all terms after 


the Nth. 
(ii) no finite limit. 
2 ὦ -3 (vi) sequence oscillates 
(ii) 3 (vii) 0 
(iii) 1 (viii) 0 
(iv) 0 (ix) α, — 20 
(v) oscillates (x) dn —> — 0 


3 (3 — 1/3"-1, C, 3. 
(ii) 410 + 5/(—2)"-4, C, 10/3. 
(iii) 30" — 1), Ὁ. 
ς X(1 + (—1)"*? x") 2 ple es " 
4 (i) ΓΕ" »x ~ —1; (ἃ) x71 + x)”. 
5 @C, -—1 <x <1, Dall other x; 
(ii) C; (iii) C. 
6 ὦ, --1 <x <1, Dall other x; 
(ii) A.C. for all x. 
(iii) A.C. all x. 
7 (ὃ D; Gi) C; (ἡ C. 
8 (i) D; Gi) D. 
9 (i) 27; (ii) 1; (iii) ©. 
11 (i) —1 <x < 1; Gi) 0 only. 


EXERCISE 4 (page 76) 
1. (i) 2x exp (x*); (ii) sec x tan x exp (sec x); 
(iii) 3x? log (1 + x°) + 3χ5 (1 + x9); (iv) 2x[1 + log (1 + x?)]; 
(v) tan-* x; (vi) 1/-/(x? — 3); 
(vii) 4x cosh (2x?); (viii) log x. 
2 Find log y in each case and differentiate implicitly: 
(i) x71 + log x); (ii) (sin x)*[log (sin x) + x cot x]; 
ii) 40./x)7C1 + log x) (iv) xV7(2 + log x)/2./x; 
(v) x*"*[cos x log x + (sin x)/x]; (vi) 10'°**"* cot x log 10. 
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3 log (1 + x) + x/(1 + x); 0; 2; —3. 
4 (ὃ 2A(cosh x cos x — 1)/(sinh x — sin x)?; (ii) —sech x tanh x; 
(iii) tanh7? x; (iv) sech x; (v) 2/sinh 2x. 
5 0<y< 1, minimum at (0, 0), x + +0, y+1-, 
6 —e- [a cosh (pt — bx) + bsinh (pt — bx)]; p e~* sinh (pt — bx). 
7 (i) 2? log 2; (ii) sin 2x exp (sin? x); (iii) (av? log a)/24/x; 
(iv) 3 cosh? x cosh 4x; 
(v) ΥἽἹν = (6x)/(x? + 1) — 2/(2x — 1) — 6/(3x + 2); 
(vi) —2/(1 — x?); (vii) 2x ~ 1) exp (1/x); (viii) a*x* (log a + a/x). 
8 x¥"3(y log x + 1); x*(log x)?. 
9 minimum (0, 0); maximum [4, (4/e)*]; as x > 00, y+. 
10 minimum (0, —3). 
12 (i) exp (x sin x)[x cos x + sin x]; (ii) (log x)?[log (log x) + (log χ)" 1] 
(iii) (1 + x + log x)/(1 + x)?; (iv) —1/[xQog x)?}; (v) sec x. 
17 Qx + γ)α — 2y). 
18 x = 1-70, y = 0:597 or x = —0-597, y = —1-70. 
19 14-11. 
20 —0-405, 0-916. 


EXERCISE 5 (page 98) 


1 Consider the derivative of the difference 1 — e-8 — Be-? and of 
B-1+e8, 

2 -2:5% 

4 3p, + πρα. 

5 (i) 4; (ii) — 4; (iii) 1; Civ) 2; (v) ἃ; (vi) 2. 

6 (ὃ 4; Gi) 0. 

7 (i) ἐ; (ii) 15 Git) 2; Gv) — 1/2; (v) κ. 

9 -- 


οο 
10 Σ (-1)"t1x"/n, --ἰ <x <1, 
aa 


1 
(i) x + x9/3 + x°/5 +. . .; (ii) x + x9/6 + x5/244 0... 
11 x — gx? -+ 4x9 — 2. 1b τ x2/284 xt/4tee 


12 (Ὁ Σ (—1)"x?"+1/(Qn + 1)}, all x; 
n=0 
(ii) > x" sin (6 + 4nz)/n!, all x; 
n=0 
Gi) Σ (cx)"/n!, all x; 
n=0 
(iv) Σ x", |x| <1; 
n=0 
Ὁ) Σ ὧν" |x| <1. 
n=0 
13, © (0511 4x2y"Yn, |x| < V2. 
n=l 
141 +x + 4? -- y?). 
15 (i) minimum (1, —2); (ii) saddle point (1, 2); (iii) maximum (0, 1). 


17 minimum (0, 0); saddle points (+5, +6). 
18 minimum (0, 0); maxima (0, +1). 
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19 minimum (0, 0); saddle points (+1, 0). 
21 —34/9. 

22 saddle point (0, 0); minima (0, +a). 
28 Hint for last part: evaluate 


Laer ὃ 2 τς oz 4 xe é oz oz 
sae er "Oty "Op Ou 3p ago Ont 


29 1. 
30 Slope of tangent is —3. Equation of the tangent is y = (—3)(x — 4). 


EXERCISE 6 (page 123) 
C and A are arbitrary constants of integration. 


1 3a2x7/7 + C. 
2x8 + 2x7 +2xX4+C. 
3 a*x™/7 + fabx* + bx + C. 
4 32,/Qax') + Ὁ. 
5 2Ve™+ C. 
6 arcsin (χ( ν 2) — argsinh (x/+/2) + C. 
7x—cothx+C. 
8 C + 2%e7/(1 + log 2). 
9 C—4V(5 — 4x). 
10 C — a log |a — αἱ. 
11 x + Flog [3x + 11+ C. 
12 4x? +x + 2log|x—1[+C. 
13 4 arctan (3x) + C. 
14 log [1 + cosh x| + C. 
15 VO? + 1ID4+C. 
16 log |tan x| + C. 
17 Ὁ — log |sin x + cos x]. 
18 log |x + cos x| + C. 
19 sin x — }sin?x + C. 
20 C — cos x + 3 cos*® x — ἔ οοβϑ x. 
21 4+sin? x -- ἔϑιηδ χ + C. 
22 Ὁ — 4cos® 4x + 4cos® dx. 
23 C — }cot® x. 
24 {C+ 2x + }sin 2x — ἢ sin 4x — ἢ sin 6x}/32 
25 C —cot x — 4cot® x. 
26 C—x+ tFtan 5x. 
27 C + kos 2x — (cos 8x)/16. 
28 2 log |x + 2| + 5. Ἐ2 6. 
29 } log [A(x + 4)°/x]. 
30 log [A(x? + 4x + 8)] — S{arctan(x + 2)/2}/2. 
31 2/(x? + 4x + 8) — 5 log [( + 2) + VQ? + 4x4 81+ C. 
32 arcsin [(2x + 1)//5] + C. 
33 3 arcsin (2 ~ x) — (4x — x2) - C. 
34 arcsin (2x — 1) 4+ Ὁ. 
35 arctan οὗ + C. 
36 $+/3 arctan [(x + 2)/1/3] + C. 
37 $ log [x?/(1 — x3] + C. 
38 log [A(x + 1)?(x — 3)°]. 
39 log [x/~ + I] -- τὰ ἘΠ) Έ Ο 
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40 tlog [([ + x)/(1 — x)] + darctanx +C. 
41 4 log [A(x + 5) — 1). 

42 C+ 4x? — x + log [x(x + D/& — DI. 
43 } log 2. 

44 9/128. 


47 4. 

48 274/39. 

49 tn — 1. 

50 x arcsinx + νᾷ( — x7) + C. 

51 1χ' arctan x — 4x? + flog (1 + x) +C. 
52 e!"ax — a) + Ὁ. 

53 sinx — xcosx+ C. 

54 x"*1 [log x)/@ + 1) — 1{{πτ-Ὲ 1}}] + C. 
55 # log (4/3). 

56 1(π-- 2). 

57 «7 — 2. 

58 (2 — 3e-")/13. 

59 I, = ex" — πἴ,- ν᾿ εἴ 3 — 3x2 + 6x — 6) - Ὁ. 
60 13/15 — tx. 

61 24(e7 — 1)/85; 3(e7 + 1)/13. 

62 (377 — 16)/64; (607 — 149)/225. 

63 2, 0. 

64 C + [arc cos (+/2/x)]/V/2 if x > 1/2. 
65 C — log (1 + e7?). 

66 (3x? — 2)'°/60 + C. 

67 Rx -- 2να Ἑ 1) Ἐ Ὁ. 

68 log [sinx + νᾷ + sin? x)] + C. 

69 —t. 

70 3π|16α5. 

71 Divergent. 

72 log 2. 

73 2. 

74 Divergent. 

75 Divergent. 

76 π. 

77 t log |(tan 4x + 2)/(tan $x — 2)} + C. 
78 C—x+2/(1 — tan 4x). 

79 C — log |cos x — sin x]. 

80 71/3/9. 


EXERCISE 7 (page 144) 


5 162/15. 

8 72(n? — 6)/48. 
9 4315; 7a?/20. 
10 32/8; 5/6. 

11 16/3; 37/8, 472. 
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12 (22*+23n!)/(2n + 3)2Qn + Qn — 1). . . 3; area 2564/2/105; 
centroid (4/3, 7+/2/16). 

13 32/4; 4/5. 

14 (a/2, + 7/3). 

15 (3at?/5, 3at/4), 27a®r*; 8na®t®/5; t = 5/4. 

16 4M)3. 

17 Distant (2a sin «)/(3x) from the centre along the line of symmetry; 
volume (47a? sin? «)/3. 

18 7Ma?/5. 

19 3Ma?/\0. 

20 x = p = 4a? + ab + b?)/3nla + δ); Χ = Κὶ = 2α{π. 

21 (i) 27a°; (ii) 1670/15; (iii) 8 πα 5. For the area in Ist quadrant. 


EXERCISE 8 (page 182) 


Li=4, 1=25; Tso = darbirCra + AagbonCee + GagbaxCxg and 441D4 Ceo = 
Qa1b11C12 + @41D12Cog + Ggibis¢32 + 4ib14Cag etc. 


2 Ἢ 3 0} (iii) (= = 
1 10 —1 —-3/. 
—1 12 
3. (i) αρμδικ; (ii) aisbye3 (11) (δικ, Bons.» +s θην): 
(iv) (ix, Giz, - - «5 Gin). The kth row, ith column of B’A’ is δικαμ + 
baxQi2 +... et Baring 1.6. ῥικαμ. 


5 PQ = ΑΡιΑ4Π44 = AD,ID2A = A(D,D2)A 
QP = AD,AAD,A = AD2ID,A = A(D2D\)A 
Diagonal matrices commute. 
6 (gb-ah); (ae — db). 
711. 
8 (i) --35 (transposed); (ii) —70 (interchange C2 and C3; --20 9); 
(ili) —35 (Re — Rs); Civ) —35 (Ci — 4Cs); 
(v) 525 (3C2 — 503). 
9 22. 


1 Ι 4 | 
10 =—|4 -3 --ιὸ 
21) -7 6 
11 (i) consistent, indeterminate, x3 = «, 3x2 = 14 — a, 3x, = 2— α. 
12 (i) consistent, r = 3, x4 = —a, 12xg = 4x2 = θα — 6, 4x, = lla + 2; 
(ii) consistent, r = 3, (1, x2, χα) = (—1, 2, 7); 
(iii) inconsistent; 
(iv) consistent, r = 3, x2 = 0, x4 = a, x1 = 6 — 2a, x5 = 3a — 8. 
13 Multiply each equation by x. System of four equations with three un- 
knowns x°, x?, and x. Condition for consistency is 


EXERCISE 9 (page 238) 


1 Weight, tension and momentum are vectors. 
2 No. No. 
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3 Use property that in any triangle ABC 
AB < AC + CB. 


4 AC = 2a + 2b, DB = 2b — 2a, CB = —3a — b, CA = —AC, 
5 (i) FE = = $4C - - 318 Ξε = MAC - AB) = LBC. 
(ii) Use OD = OB+ 486, ὉΕ-οσ. 4CA, OF = OA + 248, 
and AB + BC + ΓΑ =0. 
= (AB + AD) + (CB + CD) = 240 + 2€Q; 
AQ = AP + PO; CO = CP + PO. 
7 3b — 28. 
9 G is centroid of ABC. 
10 (i) 1/101; direction ratios [8: —1:—6] 
direction cosines 37°15’, 95°43’, 126°39’. 
(ii) ν΄ 158; direction ratios [10:7:3]. 
11 GB, --3, -- ἢ 
12 5; 55°33’, 64°54’, 45°. 
13 τ = (2i — 3j + k) + 10Ἱ + k); (2, 27/5, 11/5) or (2, —57/5, —1/5). 
14 (χ — D/-3 = ᾧ — D/-2 = @ — 1/2. 
15 (3/7, 37/7, 2/7); (11, 0, —5). 
16 134i + j + 4k)/+/33. 
17 —7/4/53. 
18 12(4j + 5k)/41. 
19 ν (412). 
20 ν (956). 
21 24 units of work. 
22a+b=c,a.c=0. 
23 Area = |a X δ] = 31/10; (Si — 4] — 7Κ)3 ν 10; 5x — 4y —7z=0. 
24 (—3i + 5j + Hk)/V155; (16i + 25] — 7k)/V/6V/155; 
(—49i + 30j — 27k)/V/26V/155. 
25 —12i + 15j + 9k 
29 . 2, 3). 


32 —2, 3/2. 
33 (2, —1, 0). 
34 Planes have a common line of intersection. 
35 Planes form a prism. 
36 (a x b).a X c= [c X (a X b)] .a and expand the triple vector 
product. — 
37 Write CB =a, CA=b, AB=c, c=a-—b, and form the product 
ς.ς. 
x y zit 
X1 yr 21 Lh eee _— = = 
38 Xe Yo ta 1 = 0; 10x + 9y — 32 -- 17 = 0. 
Χβ 783 23 1 
39 —sin ti — 2 cos 21} + 2tk; —costi+ 4 51η 21} + 2k; 
(cos? t + 16 sin? 2t + 4). 
42 Equation of circle r = 2, r is at right angles to r. 
44 aw? radially towards the centre and ad tangentially in direction of 
increasing 6. 
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45 Radial and transverse components of velocity given by (% cos 6 + » sin 9) 
and (Ὁ cos θ — x sin 6) respectively. Similar expressions for acceleration 
components. _ 

47 + = or cot «; ré = or; constant angle is α. 
f= w*r(cot? α -- 1); f= = 2w?r cot « 

48x =P+isy=2t+2;253-¢+ 05/6. 

49 (ὃ 4 = 3, 4/V17, —1//17)? 

A= 9, ({ν2, —1/2)?. 
(ii) 4 = —1, (0, 1/2, —1/-V2)7; 4 = 0, (//2, —1/+/2, 0)7; 
= 6, (7/+/662, 17/+/662, 18/+/662)7 
50 Characteristic equation A? — 27A? + 180A — 324 = 0; 4 = 18, 6, 3; 
A = 18, (2/3, —2/3, 1/3)7; 4 = 6, (2/3, 1/3, —2/3)7; 
λ = 3, (1/3, 2/3, 2/3)7. 
51 A= —1, —1,2;4= 2, (1, 1, D7; 4 = —1, «(1, —1, 0)7 + £1, 0, —1)7 


Choose two vectors i + j + k and i — j, the third is then j k 
1 1 1 
1 —1 0 


ie, i+ j — 2k. Three mutually orthogonal normalised eigenvectors are 
(1/V3, 1/3, 1/V3)?, A/-v2, 1/2, 0), and (1/6, pea — 2/6)". 
52 (ν(φ): = m = 2xi + 2yj + 2zk; (W4)2 = ne = 2xi + 2yj 
D; . Πα = |n;| [Π2 cos θ, 1.6. cos @ = (4x? + 4y? — ᾿ς μὰς +9) 
when x =1,y= — 2, 2 -- 3. Hence cos θ = 1/4/6. 
53 ¢ = PQ? = (x — 2)? + (y+ 1)? + @ — 1)? 
grad ¢ = 2(x — 2)i + 2(Υ + Dj + 2 — Dk. 


grad ¢ .a ΞΕ (2(x = 2), 2(y + 1), 2(z Se 1)] . [ᾳ, 2, 1)] ὩΣ 416. 
lal VA+4+1 
54 F = v¢ = V (log r) = (wr)/r = r/r?, Work done in small displacement 
is F.dr with dr = —o@ sin ὡδὶ + ὦ cos wsj + ak. Total work done is 


2π|ὼ 
[ Ε. dr. 
0 


EXERCISE 10 (page 271) 
1. (i) —d + ic; Gi) 5 — 3; (iii) —4 + Bi; (iv) —2; (v) (- 11 - 20/125; 
(vi) (—3 + 11/5. 
2 (i) (xt — 6x? y? + y*) + ἰ(άχϑ — 4xy%); 
(ii) [( — x? — y?) + 2 ΙΓ — x)? + 7]; 
(iii) x(2x? + 2y? + 1)/2(x? + y?) + iy(2x? + 2y? — 1)/2(x? + y?). 
= (PC — LQ)/SQC. 
4 (i) 3e"; (ii) 3e'7; (iii) /2e'7'4; (iv) 1/2e9'7'4; (v) le 17/6 
(vi) 16e7'7/3; (vii) /2e-1''; (viii) 16/2e'7'*; (ix) /6e7 9 
where 9 = tan71(2 — 1/3) = 2/12. 
5 νΖ2( -- ὃ; --2΄᾽΄ δὴ 
6 χ-ε —1+42i, x = | (twice). 
7 efietkris), k = Q, 1, 2, 3,4, 5. 
8 2H/2eter/9+2kmI/9) Κι = 0, 1,2. 
9 z= elms + ahem IS). k= 0, 1, 2, 3, 4. 
Evaluate (z — Ζ2)}[(2 --- Ζο)(Ζ -- za) ][(Ζ — Ζ:)(Σ — 2s)]. 
14 Second equation gives x = kw or y= 0; if y = 0, the first equation 
gives cos x = —3 which is impossible; if x = kz, the first equation gives 
cosh y = 3 when k = 2r + 1 (& and r integers). 
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15 (i) line x = 2; (ii) line y = —x; 
(iii) interior of wedge 0 < 6 < 2/3; (iv) interior of circle x? + y? = 1; 
(v) circumference of circle centre (3, 0) radius 2; 
(vi) exterior of circle centre (0, —3) radius 4; 
(vii) circumference of circle x? + y? = 2; 
(viii) circumference of circle (x — 2)? + y? = 2; 
(ix) interior of annulus x? + y?= 1; x2 γῆ = 4 between the radii 
6 = 7/4; θ = π[2; boundary, except line 0 = 7/4, is included. 
16 (i) analytic on line x = 0; 
(ii) analytic everywhere; f(z) = 2 + 33; 


(iii) analytic everywhere except at z = 0; f’(z) = —{cos (1/z)}/z?; 
(iv) not analytic. 
ou ou Ou Ou ou ou 
τρ 4 ou ποθ Me cin g CH Ou 
17 Write a cos 8 ὃχ + sin 0 By’ δῦ rsin ix + rcos 6 By etc. 
and use Cauchy-Riemann equations. 
18 a = 2 τ. 4; ὃ = —1 =c. (Cauchy-Riemann equations to be true every- 
where.) 


19 Real axis y = 0, --οῦ < x. < —1 maps into real axisv = 0, +0 >u>0 
and y = 0, — 1 < x < +0 maps into real axisv = 0,0<u< +o. 
Imaginary axis x = 0, —-0 < y < +9 mapsinto parabola v? = 4(1 — u). 
Semicircle maps the region interior to quadrant of circle 


(x+1?+yY=1,x>0,y>0. 
20 AB: line v = 0 from u = 3 tou = 2. 
BC: circle (u — 3)? + v? = 3, υ <0, fromu = 2 tou = 14. 
AC: circle (u — 2)? + (v — 1)? = 2, υ < 0, fromu = 3 tou = 14, 

21 Equation of curve in w-plane is p = e* where w = pe'?, 

22 v = χϑ — 3xy? + constant; w = iz* + constant. 

23 Slope of curve u(x, y) = constant, is mm, = (—(@u/@x)/(au/éy), and slope 
of curve v(x, y) = constant, is m2 = —(@v/Ax)/(v/éy). Use Cauchy- 
Riemann equations to show that mya = —1. 

(i) 4 = const. are rectangular hyperbolae x? — y? = A? and 
v = const. are also rectangular hyperbolae xy = μ. 
(ii) u = const. are systems of coaxial circles (x — 4)? + γῆ = A? and 
v = const. are also systems of coaxial circles x? + (y — μ)2 = p?, 
(iii) u = const. are systems of hyperbolae x?/sin? u — y?/cos? u = 1 and 
v = const. are systems of ellipses x2/cosh? v + y?/sinh? v = 1, 
24 v=y— x + 3xy?+ C;w=z— iz + iC. 


EXERCISE 11 (page 305) 


A, B, C and Ε are arbitrary constants. 

1 (i) y = Ae*'5; (ii) y = log (A + 67); 
(iii) ἀν 5 + 1) + 3 log Ὁ" + DN =C; 
(iv) 4y? = [3\/(2x) + CP; (v) 2y + sinh 2y = 4 οοβ x + C; 
(vi) 2/4 — y?2) = V(2x7 3 ἘῸ 

2 (i) 1 +e-¥ = Ae; (ii) 3y — 3 log [1 + y| = χϑ + A; 
(iii) e7(1 — e%) = 1 — e; (iv) 4 — γῆ = «( — x)? 

3 @y=& + 1)? — 16 + 1)" 5; (Ὁ y = 3 sin x cos x + A cos? x; 
(iii) y log x = x logx — x + C; (iv) y = 1 + A cos x; 
(v) y = 2x cos x — sin x + A cos x. 
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4 (ii) y = A cosec x + 2 cot x + 2x — x? cot x; 
(iii) y = 4x3 + Ax? 
5 (i) xy = Ae*”; (ii) Ὁ — x)® = AxQy — x); 
(iii) y = Ax(y + x); (iv) xBy? + x7) = Ὁ; 
(v) y = Cx. 
6 (i) 4x? + xy + ἐν = C; (ii) x?eX +7 OX = C; 
(iii) x2y + xy? — 42 = C, 
7 (i) x7 + y? = 1; (ii) y = (arcsin x)? + A arcsin x + 8; 
(iii) κἂν = Β — kx + log (A + e?**). 
8 v2? = u(2kx — 1)/(2k*). 
9 (i) Ae~?* + Be; (ii) A + Be *; 
(iii) (A + Bx)e~?*; (iv) e?7(A cos 3x + Bsin 3x); 
(v) e™*(A cos mx + Bsin mx) + e~™(C cos mx + Esin mx). 
10 ὦ (@) y = 466" — ὁ. 59, (b) y = ἐ(50:: — 6192); 
(ii) (a) y = Axe*, (δ) y = (1 — x)e*. 
11 (ὃ Ox? — 24x + 26)/27; Gi) 4Gx? + 8x). 
12 (ὃ e-*(A cos 2x + Bsin 2x) + (10x + 16)/25; 
(ii) e2*(Ax + B) + fx? + 4x + 3; 
(iti) Ae~? + Be*!S — 2x? + 16x — 84; 
(iv) A cos (χν 3) + Bsin (χν 3) + C + Ex — x9(4 — x?)/36. 
13 (i) Be”; (ii) 3 xe-7; (iii) ἔχϑε"; 
(iv) te-7(2x — 3); (v) (25x? — 20x — 2)e7/125; 
(vi) dxe%*, 
14 (i) ε΄ ΤᾺ cos 2x + Bsin 2x) + {6 Ὁ + 2x); 
(ii) A + Be-?? + e77(8x? — 12x + 7)/64; 
(iii) A + e7(B + Cx — $x? + 3x). 
15 (ὃ —4cosx + 3 sin x; (ii) $x sin 3x; 
(iii) —e* sin x; (iv) —4xe7 cos x; (v) — $x? sin x. 
16 (ὃ Acos 3x + Bsin 3x + #cos 2x; 
(ii) Ae~4** + (4 cos x + sin x)/17; 
(iii) Ae“ 25 + Be~%? + (cos x + sin x)/10 — (5 cos 2x — sin 2x)/52. 
17 (i) x = 2te”'; (ii) x = 2 sin t — sin 21. 
18 (i) y = ἐχο Ὁ — 2); (ἢ y = 46. (1 + x). 
19 (ἢ e7(1 — 4x + 42°); (ii) e? — οἱ 35. 
20 (ὃ 4e-*(x — 1) + 4e°”; 
(it) [e~7(18 cos 3x + 40 sin 3x) + 3 sin 3x — 18 cos 3Χ]111. 
21 (ὃ (Ax + Boe* + x? + 6x? + 18x + 24; 
(ii) e®*(A cos 2x + Bsin 2x) + e(11 cos 3x — 24 sin 3x)/697; 
(iii) (A + Bx)e*; (iv) e-7(4x + B) + x - 1; 
(v) 46. + Be-?* + }e?; 
(vi) (Ax + B) cos 2x + (Cx + E) sin 2x. 
22 (i) Ae? + Be~** — (4cos 2x — 3 sin 2x)/25; 
(ii) Ae~* + Be?*; (iii) A + Be~** — (2 cos 2x + sin 2x)/10; 
(iv) Ae-2#!5 + €7(49x3 — 105x? + 150x + B)/1 029; 
(v) e?7(A cos x + Bsin x) + e%(5x sin x + 10x cos x + 2 sin x + 
14 cas x)/25 + (25x? + 40x + 22)/125. 
23 Isoclines y = (1 — m)x. Solution curves are hyperbolae with asymptotes 
x = Oand y = dx. 
24 Solution obtained from v’x = 1 where v = y/x. Isoclines are straight lines 
y = (μι — 1)x. Solution curve through (1, 0) is y = x log x. As x +0, 
y > 0, y’ > ©; Turning point at (1/e, —1/e); x + οὐ, y> οὐ. 
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25 Isoclines are (x — m/2)? + y? = (m/2)? where m is slope on solution 
curves. These are circles through the origin (0, 0) centres (n/2, 0), radius 
m/2. 


EXERCISE 12 (page 320) 


1 (i) 3χ5 + Sx* + 10x° + 24x? + 48x + 96, remainder 191; 
(ii) x° + 4x?/3 + 4x/9 + 4/27, remainder —23/27. 

2 Function has zero value at x = 1, quotient is 4x3 — 2x? 42x — 1 
which vanishes at x = 4, final quotient is 4x? + 2. Factorization is 
(x — Ι)χ — 12x? + 1). 

3 3x3 + Ἰ|α — 3) — 4/(x — 3)? + 5/(x — 3)4. 

4 (ὃ 4x* + 8x — 3x? — 6x — 21, remainder —24x + 76; 

(ii) 2x* — 3x3 4+ 2x? + 7-5x — 19-25, remainder 65-25x — 6-25. 

5 Roots of given equation are a, β, and y anda + B+ y=0, αβ + ay + 

By = 2/3, «By = 1/3. Roots of new equation are «?, £2, and y? and 


a + B+ γϑ τ (at B+ y)? — 2(.β + ay + py) = —4/3, 
028? + xy? + Py? = (aB + ay + By)? — Ζαβγία + B+ y) = 4/9, 
αὐ B27? = (aby)? = 1/9. 


New equation is 9x3 + 12x27 ++ 4x —1=0. 
6 Roots «, β, and y then « + 6 + y = 5/3, and « + β = 2. Hence yo 
—1/3. Roots are —1/3, 3/2, and 1/2. 
Tx= 35,7 + ivV11)/6. 
8 f(x) has one variation, ie. not more than one positive real root; f(-x) 
has one variation, i.e. not more than one negative real root. But f(-«)> 
0, f(0) < 0, and f(+ 0) > 0 hence there is not less than one positive 
real root and one negative real root. There is exactly one positive and 
one negative real root, and hence two complex roots. 
9 x = 2-10, 0-15, and —3-25. (Check: sum of roots is -- 1). 
10 r° cos® 6 — 3rcos 6 — 1 with r?:3r = 4:3, ie. r= 2 and 4 cos? 6 — 
3 cos 6 = cos 36 = 1/2. 
Roots of reduced equation 
30 = 2ππ:Ὲ 7f3,n=0,1,2,... 
Roots required 
X1 = 2 cos π|9 = 1-879 4 
Χῳ = 2 cos (27/3 + 2/9) = —1-532 1 
Xg = 2 cos (22/3 — π|9) = —0-347 3. 
Other values of repeat the above roots. 
11 cos 36 = 2 with 9 = « + if leads to 


a = 2nn/3, B = (cosh: 2)/3 = 0-439,n=0,1,2,... 


Roots are x, = 2 cos θ = 2(cos « cosh β — isin « sinh β), n = 0, 1, 2. 
X, = 2-196; x2 = —1-098 + 10-785; 
x3 = —1-098 — i0-785. 


EXERCISE 13 (page 352) 


1 0-34. 

2 0-38, —0-65. 
3 0-3376. 

4 —0-374. 
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5 2-646. 
6 Use Newton-Raphson method with f(x) = x? — a, 2-154. 
7 0-125 7. 
8 0-347 3; —1-879 4. 
9 First estimates: x = 4-237, y = —0-853, z = —2-508; 
First residuals: —0-000 2, +0-000 8, —0-000 1. 
Second estimates: x = 4:2369, y = —0-852 7, 2 = —2-5079; 
Second residuals: —0-000 01, —0-000 04, +0-000 04. 
10 (i) x = 4-2369, y = —0-851 5, z = —2-508 2 in six iterations starting 
from (0, 0, 0); 
(ii) x = 4-236 8, y = —0-852 6, z = —2-507 9, correct to four decimal 
places starting from (0, 0, 0). 
11 0-339, —0-041, +0-638, —0-540 correct to three decimal places in four 
iterations starting from (0, 0, 0). 
2x 0 02 04 06 O8 1:0 
y 1 08 0-67 0:58 0-51 0-46 
Solution by separation of variables is γἦ = x + 1 and y(1-0) = 0.5. 
13x 0 ΟἹ 0:2 0-3 0-4 
1 1116 1:275 1-498 1-824 
y(0-4) = 1-82. 
14x 0 02 0-4 06 0.8 10 
y 1 0.856 0-787 0-770 0-791 0-840 
Minimum occurs at x = 0-58, y = 0-76, approximately. 
15 Maximum error 0-04 (approximately). Answer is 1-8 correct to one decimal 
lace. 
16 ‘Absolute error in x is +0-005, absolute error in y is (approx.) +0-0028 | 
and y = 0:2 correct to one decimal place. 


EXERCISE 14 (page 392) 


1 Third differences constant at 0-750. Overlapping errors. Corrected values 
Ye = 438-976, ye = 636-056. 
2 [(0) = 1-40, x, = 0, h=1, fC + ul) = 1-40 + 0-64u + O-16u(u — 1) 
hence 100f(x) = 104 + 42x + 3)?. 
3 Use f(xo + uh) with xo = 0, ἢ = 2 then replace 2u by x, 
8f(x) = 568 + 200x + 24x? + x’. 
4 f(16-4) = 267-6 and f(23:5) = 408-9. 
5 sinh 4-612 = 50:337(6) in each case. Same result because A*y is constant 
over this range and sinh x is represented by a quadratic polynomial. 
6 f(0-3) = 0-295 7 (Newton-Gregory forward); 
55) = 1-415 0 (Newton—-Gregory backward); 
(1-75) = 1-226 8 (Bessel interpolating to halves); 
f(1-55) = 1-152 6 (Stirling using up to fourth differences). 
7 Xo = 40, ἢ = 10, leads to iteration equation 
u = 0-588 235 — 0-103 387 u? + 0-005 348 


and u = 0-556 2, x = 45-562° or 45° 34’. 
8 Γ((4) = 0019 92, f’(12) = ~0-000 04. 
9 Maximum occurs when x = 3.336 and f(x) = 1-226 4. 
10 (i) 0-341 277, error estimate +0-000 27; 
(ii) 0-341 508, error estimate —0-000 007; 
(iii) 0-341 505, error estimate +0-000 001. 
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EXERCISE 15 (page 430) 
τῷ ἐξ ὦ ἢ 

2 ὦ 4; Gd 4; (iii) εἰς; (iv) 4. 

3 (41(481)/(521(134) = 1/270 725. 
4 1023/1 024. 

5 (i) 52/6%; (ii) 55- 3/6". 

6 (i) 1/18; (ii) 1/9; iii) 1/12. 

7 (i) 9/20; (ii) 1/20; (ἰὴ 3. 

8 9/17, 8/17. 


9: 
10 ὦ 1/10; (ii) ὃ; Gii) 3/10; (iv) 4; (v) ὁ. 
11 @) 1; Gi) 1 — οι + x), x > 0; (1) 0-264; (iv) 0-33; (v) 2, 2. 
12 ()c = 1; Gi) x <0, FQ) = 0;0< x <1, FX) = 42; 
1<x <2, F(x) = 2x — 3X? -15x>2, ΕΛ) =1. 
13 2-527, 0-100 4. 
14 (i) mean decreased by 10, standard deviation unchanged; 
(ii) new mean 90% of old mean, new s.d. 90% of old s.d. 
15 (ὃ f = 0-031 25, 0-156 25, 0.312 5; 
(ii) f = 0-001, 0-010, 0-044, 0-117, 0-205, 0-246; 
(iii) zero until r = 5 when f = 0-002, 0-005, 0-014, 0-032, 0-061, 0-098, 
0-133, 0-155. All three histograms symmetrical. 
16 f = 243, 405, 270, 90, 15, 1. 
17 (i) (a) 60-5, (b) 60-7; (ii) (a) 9-0, (b) 9-0. 
18 16-53, 29-75, 26:78, 16-07, 7-23, 2-60. 
19 (i) 0-049 8; (ii) 0-005. 
20 42:8. 
21 (i) 50; (ii) 68-26; (iii) 95. 
22 1002-635, 3-55 P{x < 990} = 0-000 19. 
P{x > 1010} = 0-019. 


EXERCISE 16 (page 460) 
1 (ὃ 0-37; Gi) 0-091 2. 
2 X = 1-997, machine should be adjusted. 
3 |X| = 7-298, mean has changed. 
4 X = 2-67, significant difference, firm’s claim discredited. 
5 2 655. 
6 X = 1-677, there is no reason to doubt that the entry requirements are 
the same; X = 0-833, no conclusion. 
7 Not necessarily, X = 2-236 (i) significant, (ii) not significant. 
8 ¢ = 2-897, significant difference. 
9 X = 1-284, no evidence that men are better drivers. 
10 1 = 2-74, significant, sleeping drug is effective. 
11 ¢ = 1-51, not significant, coaching not effective from data. 
12 ¢ = 2-23, not significant. 
13 ¢ = 0-73, there is no reason to doubt that the rods come from the same 
population. 
14 (i) 0-444, 0-391, 0-138, 0-024, 0-002, 0-000, . . .; 
(ii) 0-472, 0-354, 0-133, 0-033, 0-006, 0-001; 
(iii) B.D. x? = 38-74, P.D. χ' = 30-48, increase is significant in both 
cases. 
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15 x? = 3-56 with 4 degrees of freedom, fit satisfactory. 


16 x? = 46-79 with 6 degrees of freedom, significant, larger canteens produce 
cheaper meals. 


17 χ = 1-87 with 4 degrees of freedom, fit satisfactory. 


18 x? very large with 12 degrees of freedom, manual dexterity and success 
in examinations are not independent. 


Index 


ABSCISSA, 13, 322 
Absolute, convergence, 51, 254, 255 
error, 344 
error modulus, 345 
frequency, 406 
magnitude, 4 
maximum, 90 
value, 4 
A.C. bridge, 509 
Acceleration, 225 
radial, 226 
transverse, 226 
Accuracy, 346 
Addition, formulae, 71, 263, 397 
of complex numbers, 247 
of directed magnitudes, 186 
of matrices, 150 
of vectors, 188 
theorem of statistics, 397 
Adjugate or adjoint matrix, 168 
Admittance, 506 
Algebra, of complex numbers, 247 
of matrices, 150 
of vectors, 188 
Algebraic equations, 146, 174, 178, 
261, 310 
Alternating current, 500 
Alternating series, 58 
Amplitude, 251, 292 
Analogue computer, 491 
Analytic functions, 226 
Angular momentum, 217 
Angular speed, 226 
Approximate, differentiation, 380 
integration, 382 
solution of differential equation, 
338 
solution of equation, 323 
Approximation, 33, 81 
to roots, (Ch. 13), 321, 378 
Arbitrary constant, 102 
Area, as integral, 128 
as limit, 126 
by integration, 108, 109 
definition, 127 
in polar coordinates, 130 
under curve, 109, 129 


Argand diagram, 246, 250, 253, 259, 
260, 266 

Argument, 1, 25, 354 

Array, 147 

Associative law, 151, 155, 185, 192, 
207 

Asymptote, 472 

Attribute, 395, 443 

Augmented matrix, 160, 172, 335 

Axis, of imaginary numbers, 246 

of rea] numbers, 1, 246 


BACKWARD INTERPOLATION, 364 

Balance, equation, 511 
bridge, 510 

Base vector, 496 

Bending of beam, 299 

Bessel, 366, 372 

Bessel’s interpolation formula, 372, 

373, 374, 377, 381, 382 

Binary quadratic, 92 

Binomial, coefficients, 31, 361, 367 
distribution, 417, 422, 427, 444 
frequency function, 417 

Boundary conditions, 478 

Bounded sequence, 49 


CARTESIAN, AXES, 194, 207 
coordinates, 13, 131, 193, 196, 202, 

250 

Catenary, 472 

Cauchy mean value theorem, 81 

Cauchy-Riemann equations, 267 

Capacitative network, 504 

Central limit theorem, 439 

Centre of mass, 134, 135, 136, 137 

Centripetal acceleration, 227 

Centroid, 134, 137, 198 

Chain rule of differentiation, 95 

Chi-square distribution, 451 

Circular functions, 299, 472 

Class interval, 406 

Coefficient matrix, 333, 487 

Co-factor, 162, 163 

Combinations, 399 

Commutative law, 151, 158, 185, 187, 
207, 209, 210, 248 

Comparison test, 55 
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Complementary, event, 400 
function, 287 
Complete primitive, 287 
Complex, current, 501 
functions, 263 
numbers, (Ch. 10) 244, 256, 500 
roots, 258, 261, 319 
variable, (Ch. 10) 263 
voltage, 502 
Component vectors, 192, 193 
Concavity, 34 
Conditional, convergence, 51 
probability, 401 
Conduction, 477 
Confidence interval, 438, 448 
Conic section, 471 
Conjugate complex numbers, 248 
Consistent system, 176 
Contingency table, 454, 455 
Continuity equation, 526 
Continuous, function, 4, 22 
variable 2, 405, 406 
Convection, 477 
Convergence, (Ch. 3), 47, 51, 60, 321, 
327, 329, 335 
absolute, 51 
conditional, 51 
radius of, 60 
tests, 55 
Convergent integral, 121 
Cooling fin, 475 
Coordinates, 13, 14, 15, 16 
Cosine series, 88, 254 
Cramer’s rule, 172, 174, 175 
Critical damping, 290 
Cross product, 209 
Crystal, 496 
axes, 496 
Crystallography, 496 
Current, complex, 501 
cyclic, 479, 492, 509 
matrix, 480 
network, 479, 490, 492, 500 
response, 501 
Curve sketching, 34 
Cyclic current, 479 
Cycloid, 132 
Cylindrical polar coordinates, 14 


D-OPERATOR, 22, 285 
Damped oscillations, 290 
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Damping factor, 290 
Definite integrals, 107 
Degree of differential equation, 274 
Del, V, 235 
De Moivre’s theorem, 256 
Dependent variable, 1 
Derivative, 20, 22, 41 
directional, 233, 236 
material, 524 
of complex function, 266 
of vector, 221, 524 
Descartes rule of signs, 319 
Detached coefficients, 311, 312 
Determinants, (Ch. 8), 148, 160-68, 
487 
Deviation, mean, 411 
standard, 411 
Diagonal, matrix, 158 
terms, 335 
Difference table, 354, 357 
Differences, 355 
Differentiable function, 20, 22 
Differential, 34 
coefficient, 20 
Differential equations, (Ch. 11), 63, 
102, 274, 511 
approximate solution of, 338 
Euler method, 337, 340 
numerical solution of, 336 
Differentiation, (Ch. 2), 20 
chain rule of, 95 
following the motion, 525 
logarithmic, 69 
notation, 21 
numerical, 380 
of complex function, 266 
of exponential function, 67 
of implicit function, 28 
of inverse function, 38, 39, 75 
of logarithmic function, 67 
of power series, 60 
of vector, 221 
parametric, 27 
partial, 41, 93 
rules, 23 
total, 94 
Diminishing of roots, 318, 325 
Directed magnitude, 185 
Direction, angles, 195 
cosines, 195, 482 
ratios, 196, 497 


INDEX 


Directional derivative, 233, 236 
Discrimination of maxima, 36, 37, 92 
Dispersion, 411, 412 
Displacement, 486, 520 
gradient, 520 
in elasticity, 519 
rate of, 520 
vector, 188, 496 
Distribution, binomial, 417, 422, 427, 
444 
Chi-square, 451 
function, 405 
Gaussian, 424 
normal, 424, 427, 434, 436 
of mean, 434 
Poisson, 415, 420, 422, 453 
standardized normal, 425 
student-t, 447 
Distributive law, 155, 185, 192, 207, 
210 
Divided difference, 357 
Divergence, 51, 321, 327, 330, 335 
Divergent, integral, 121 
series, 51 
Division of complex numbers, 249, 
251 
D-notation, 285 
Dot-product, 206 


e, 51, 64 
Eigenvalue, 227, 228, 479, 484, 487 
Eigenvector, 227, 228, 479, 484, 487 
normalised, 229 
orthogonal, 231 
Elasticity, 519 
Electrical network, 146, 299 
Element, 147 
Elementary event, 396 
Ellipse, 471 
Ellipsoid, 133 
Entry, 354 
Equality, of complex numbers, 247 
of matrices, 150 
of vectors, 188 
Error, 81, 322, 333, 344, 357, 381, 
387, 391 
absolute, 345 
in iterative process, 350 
interval, 346 
modulus, 345 
percentage, 346 
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Error—contd. 
propagation of, 346, 358 
relative, 346 
round-off, 344, 378, 391 
Estimator, 434 
Euler form of complex number, 256, 
260 
Euler method, 337 
improved, 340 
Even functions, 9 
Event, 395 
Exact differential equation, 279 
Exclusive events, 395 
Exhaustive events, 396 
Expansion in power series, 86 
Experiment, 395 
Exponential function, (Ch. 4) 63, 65, 
68, 254, 295, 299, 354, 361, 363, 
468 
limits, 66 
series, 64, 88, 254 
Extension of spring, 299, 485 
Extreme value, 91 


FACTORIAL FUNCTION, 363 
Factors, of linear operator, 286 
of polynomial, 313 
Factor theorem, 311 
False position rule, 323 
Filter, 490 
band-pass, 495 
high-pass, 495 
low-pass, 494 
First, difference, 355 
moment, 135, 144, 412 
order differential equations, 275, 
280, 337, 468, 513, 514 
Fluid dynamics, 524 
Forward interpolation, 359, 360 
Frame network, 480 
Freedom equations, 202 
Frequency function, 403 
Function, of complex variable, 263 
of one variable, 1 
of three or more variables, 43 
of two variables, 16, 89 


GAMMA FUNCTION, 363 
Gauss, 172, 333, 366 
interpolation formulae, 368, 371, 
372, 387 
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Gaussian, distribution, 424 
elimination method, 172, 230, 333, 
352, 480 
Gauss—Jacobi method, 335, 336, 352 
Gauss-Seidel method, 336, 352 
Generalized factorial function, 363 
Geometric series, 2, 52 
Gradient, 45 
of scalar field, 232, 526 
Gregory, integration formula, 383 
interpolation formula, 360, 364 


HARMONIC, CONJUGATE FUNCTIONS, 
269 
functions, 269, 273 
series, 53 
Helix, 242 
Higher derivatives, 22, 42 
Histogram, 406 
Homogeneous, differential equation, 
277 
system 175, 177 
Hooke’s law, 485 
Hyperbola, 471 
rectangular, 472 
Hyperbolic functions, (Ch. 4), 70, 
262, 468, 471 
cosine, 471 
sine, 471 


i AS OPERATOR, 244 

Identity matrix, 57 

Ill-conditioned equation, 332, 334 

Imaginary, axis, 244, 503 
part, 247, 297, 510 

Impedance, 502 

Implicit function, 28 

Improper integral, 122 

Improved Euler method, 340 

Inconsistent equations, 175, 213 

Increment, 21, 31, 93 

Indefinite, integral, 102 
quadratic, 92 

Independent variables, | 
function of, 16, 41, 43 

Indeterminate, expression, 82, 175 
solution, 175, 213 

Individual, 395 

Inductive network, 504 

Inequalities, 17, 254 
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Infinite, integrals, 121-123 
limits of integration, 121 
sequence, 47 
series, 51, 52 

Inflexion, 37 

Inflexional tangent, 37 

Initial conditions, 488, 515 

Integral, standard forms, 102 

Integrand, 106, 110 

Integrating factor, 280, 282 

Integration, (Ch. 6) 
applications of, 127 
as sum limit, 126 

Integration, by parts, 115 
by quadrature, 382 
by reduction formula, 117 
by substitution, 119, 120 
graphical, 299 
numerical, 382 
of function with irrational denomi- 

nator, 114 
of power series, 61 
of rational function, 110-114 
rules, 104 

Interpolation, 323, 354, 359, 364, 373 
formulae, 371 
paths, 365, 366, 372 

Interquartile range, 411 

Intersection of planes, 213 

Interval, 1, 438 

Inverse, derivative, 38 
function, 9 
hyperbolic functions, 74, 75 
interpolation, 378 
matrix, 158, 170, 173, 480 
square law, 236 
trigonometric functions, 11, 12, 39 

Irrational number, 2 

Isocline, 299, 305 

Iteration, 321, 326, 329, 334, 335, 378 
error, 350 


jOPERATOR, 244, 501 


Jacobi, 335, 336 
KRONECKER DELTA, 167 
Lac, 504 


Lagrange form of remainder, 86 
Laplace’s equation, 269 
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Latent roots, 228 
Lattice, crystal, 496 
reciprocal, 498 

Lead, 504 


‘Leibnitz’ theorem, 31 


Level, curve, 232 
surface, 235 

lHopital’s rule, 82 

Limit, 4, 48, 66, 82, 126 

Limits of integration, 108, 121 

Linear, algebraic equations, 146 
dependence, 176 
differential equation, 285 
independence, 175, 178 
interpolation, 323, 354 
transformation, 147 
operator, 285 
programming, 464 

Linked, circuit, 492 
network, 485 

Local maximum, 91 

Location of roots, 318, 321 

Logarithmic, differentiation, 69 
function, (Ch. 4) 66, 69 
limits, 66 
series, 87 

Lozenge diagram, 366, 371, 377 


MACLAURIN EXPANSION, 86, 470 
Management problem, 464 
Mapping function, 264 
Mass, balance equation, 513, 514 
conservation, 526 
Matrices, (Ch. 8) 146, 227, 479 
Matrix, algebra, 150, 333 
cost, 467 
form of vector product, 211 
strain, 524 
symmetric, 230 
Maximum, 36, 90, 381 
Mean, 410, 414, 418, 421, 425, 434 
436, 440, 447, 449 
centre, 135 
deviation, 411 
value theorem, (Ch. 5), 79, 80, 81 
Median, 410 
Miller indices, 497, 498, 500 
Minimum, 36, 90, 381 
Minor, 162 
Mixed derivative, 43, 44 
Mode, 410 
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Model, mathematical, 407, 433, 463, 
467 
Modulus, 4, 251 
of elasticity, 485 
Moment, first, 135, 144, 412 
of force, 206, 216 
of inertia, 140 
of momentum, 217 
of vectors, 216 
second, 140, 144, 413 
Monotonic, function, 9, 34 
sequence, 49, 59 
Multiple roots, 311 
Multiplication, of complex numbers, 
248, 251 
of matrices, 149, 152, 480, 484 
theorem of statistics, 402 
Mutually, exhaustive, 396, 397 
exclusive, 395, 397 


Nas A, Y, 235 
Natural, frequency, 488, 493 
growth law, 468 
Negative definitive, 92 
Neighbourhood, 36, 91 
Network, electrical, 479, 490, 500 
frame, 480 
vibration, 484 
Newton-Gregory, 359, 360, 362, 364, 
366, 371, 376, 377, 380, 383 
Newton-Raphson, 326, 328, 331, 332, 
351, 352 
Newton’s, law of cooling, 468, 477 
method, 323, 351 
second law of motion, 276, 485, 
526 
Non-commutative, 154 
Non-singular matrix, 173 
Normal, distribution, 424, 427, 434, 
436 
mode, 486 
to curve, 234 
to plane, 212, 498 
to surface, 235, 242 
Null hypothesis, 436 
Numbers, complex, (Ch. 10), 244 
imaginary, 244, 246 
irrational, 2 
rational, 2 
real, 2, 244, 246 
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Numerical, differentiation, 380 
integration, 382 
solution of differential equation, 
336 


ODD FUNCTION, 9 
Oil purifier, 512 
Operator, 22 
D, 285, 518 
i, 244 
j, 501 
V, 235 
Order, of differential equation, 274 
of matrix, 147 
Ordered array, 147 
Ordinary differential equations, (Ch. 
11) 274 
Ordinate, 13 
Orthogonal, curves, 273 
eigenvector, 231 
matrices, 201 
vectors, 193, 247 
Oscillating, network, 299, 484, 500 
sequence, 49, 327 
Oscillations, damped, 290, 491 
Overdamping, 290 


PAPPUS’ THEOREM, 139 
Parabola, 386, 471 
Parallel axis theorem, 143 
Parallelogram law of addition, 187 
Parameter, 27, 435 
Parametric equations, 202 
Partial, derivative, 41 
differential equation, 269 
differentiation, (Ch. 5), 79, 93 
fractions, 110, 314 
sum, 51, 54 
Particular integral, 287, 292 
Permutations, 398 
Perpendicular axis theorem, 144 
Plane, 212 
Poisson distribution, 420, 422, 453 
Polar, acceleration, 226 
coordinates, 14, 130 
form of complex number, 250 
Pole, 14 
Polynomial, 289, 293, 310, 313, 315, 
316, 354 
approximation, 359, 374, 380, 383 
Population, 395 
growth, 468 


Position, ratio, 204 
vector, 193, 496 
Positive definite, 92 
Power series, 59, 86 
Predictor-corrector method, 341 
Primitive, 102, 287 
Probability, 390, 396, 401, 436 
density function, 407 
Propagation of errors, 346, 358 
Pulsatance, 485 
Purifier, oil, 512 


QUADRATIC FORM, 92 
Quadrature, 382, 383 
Quotient, 310 


RADIUS, OF CONVERGENCE, 60 
of gyration, 140 
Random, sample, 433 
variable, 395, 403, 405 
Range, 1, 411 
Rate of change, 34 
Rate coefficient, 514 
Rational function, 112 
number, 2, 29 
Ratio test, 56 
Reaction constant, 514 
chemical, 514 
first order, 516 
second order, 516 
Real, axis, 246, 503 
number, 2 
part of complex number, 247, 297, 
501, 510 
Reciprocal lattice, 498 
Reciprocal matrix, 158, 170 
Reduction formula, 117 
Redundant equation, 231 
Regular, solution, 174 
system, 171 
Relation between roots of equation, 
317 
Relative, error, 346 
error modulus, 348 
frequency, 406 
maximum, 91 
velocity, 190 
Remainder, 85-92 
theorem, 310 
of series, 54, 86 
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Repeated, differentiation, 30 
division, 314 
Residual, 332, 333, 334 
Resisted motion, 290 
Resonance, 506 
Resultant, 188 
Revolution, solid of, 133 
Rolle’s theorem, 79, 81, 319 
Roots, of algebraic equation, 261, 
311, 316, 318 
of complex number, 258, 260 
Rotating axes, 148 
Round-off errors, 333 
Rule, of false position, 323 
of Sarrus, 164 
of signs, Descartes, 319 
Rules, for differentiation, 23 
for inequalities, 17 
for integration, 104 
for interpolation, 367, 369, 375 


SADDLE POINT, 92 
Sample, 395, 433 
attribute, 443 
mean, 434 
tests, 436, 439, 446 
variance, 436, 451 
Sarrus’ rule, 164 
Scalar, 151, 185 
components, 193 
field, 232 
matrix, 158 
multiplication, 15 
product, 206 
three-dimensional, 235 
two-dimensional, 232 
Second, derivative, 22, 23, 97 
difference, 355 
moment, 140, 144, 413 
order differential equation, 283 
Seidel, 336 
Semi-definite quadratic, 92 
Separation of variables, 276, 517 
Sequence, 47 
Series, (Ch. 3), 51, 59 
Shear strain, 523 
Shortest distance between lines, 215 
Significance tests, 436, 440, 444, 447 
Significant figures, 344 
Simple, harmonic motion, 285 
sample, 433 
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Simpson’s rule, 385, 387, 389, 390, 
426 
Sine, series, 88, 254 
hyperbolic, 471 
Singular, matrix 170, 175 
solution, 175 
Skew lines, 215 
Slope of curve, 21, 302 
Spherical polar coordinates, 15 
Spring system, 485 
Standard, deviation, 411 
error, 435 
integrals, 102 
series, 87 
Stationary point, 38 
Statistic, 434 
Statistics, (Ch. 15), 394, (Ch. 16), 433 
Steady field, 232 
Step function, 8 
Stiffness, 485 
Stirling, 366 
interpolation formula, 371, 372, 
373, 376, 378, 379 
Straight line, 201, 212 
Strain, 520 
tensor, 524 
Student t-test, 447 
Substitution rule for integrals, 119 
Suffix, notation, 147 
summation, 154, 167 
Sum-limit, 51, 128 
Sum to infinity, 51 
Symmetric matrix, 230 
Synthetic division, 311, 312, 315 
System of equations, 172 


t-DISTRIBUTION, 447 

Tangent, to curve, 21, 35 
inflexional, 37 
method, 338 

Taylor series, (Ch. 5), 79, 85, 89, 310, 

311, 314, 328 

Tension coefficient, 482 

Term by term, differentiation, 60, 61 
integration, 61 

Test, Chi-square, 452 
of convergence, 55 
of distribution, 452 
of independence, 454 
of significance, 433, 436, 440 
Student-r, 447 
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Third difference, 355 Variance, 411, 412, 414, 418, 421, 425 
Torsional problem, 299 Variation, 319 
Total differentiation, 94 Vector, (Ch. 9), 188, 496 
Transformation, function, 264 components, 192 
matrix, 148, 199, 227 differentiation, 221, 224 
of derivative, 97 differential equation, 222 
Transposed matrix, 158 equation of straight line, 201 
Trapezoidal rule, 384, 387, 389, 390 in plane, 224 
Trial, 395 of amplitude, 487 
Triangle, inequalities, 190 operator, VY, 235 
of addition, 186 product, 205, 209, 211, 217 
Triangular matrix, 173, 333 quantity, 188 
Trigonometric functions, 262, 354 resolution, 192 
Triple, scalar product, 217 sum, difference, 188, 189 
vector product, 219 Velocity, 225, 226 
Trusses, 481 Vibrational problem, 299, 484 
Turning point, 36, 381 Voltage, lag, 504 
lead, 504 
UNIT, CELL, 497 response, 501 
matrix, 156 Volume, by integration, 132 
vector, 188 element, 133 
Unsteady field, 232 
Upper triangular matrix, 173 WEDDLE’S RULE, 386, 387, 389, 390 
Wien Bridge, 510 
VARIABLES, COMPLEX, 263 Work, 206, 208 
dependent, 1 
function of three, 43, 94 YATES’ CORRECTION, 456 
function of two, 16, 89 
independent, 1 ZERO, MATRIX, 152 


separable, 276, 517 vector, 188 
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